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Error Detecting and Error Correcting Codes 
By R. W. HAMMING 


1. INTRODUCTION 


HE author was led to the study given in this paper from a considera- 

tion of large scale computing machines in which a large number of 
operations must be performed without a single error in the end result. This 
problem of ‘‘doing things right” on a large scale is not essentially new; in a 
telephone central office, for example, a very large number of operations are 
performed while the errors leading to wrong numbers are kept well under 
control, though they have not been completely eliminated. This has been 
achieved, in part, through the use of self-checking circuits. The occasional 
failure that escapes routine checking is still detected by the customer and 
will, if it persists, result in customer complaint, while if it is transient it will 
produce only occasional wrong numbers. At the same time the rest of the 
central office functions satisfactorily. In a digital computer, on the other 
hand, a single failure usually means the complete failure, in the sense that 
if it is detected no more computing can be done until the failure is located 
and corrected, while if it escapes detection then it invalidates all subsequent 
operations of the machine. Put in other words, in a telephone central office 
there are a number of parallel paths which are more or less independent of 
each other; in a digital machine there is usually a single long path which 
passes through the same piece of equipment many, many times before the 
answer is obtained. 

In transmitting information from one place to another digital machines 
use codes which are simply sets of symbols to which meanings or values are 
attached. Examples of codes which were designed to detect isolated errors 
are numerous; among them are the highly developed 2 out of 5 codes used 
extensively in common control switching systems and in the Bell Relay 
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Computers,' the 3 out of 7 code used for radio telegraphy,? and the word 
count sent at the end of telegrams. 

In some situations self checking is not enough. For example, in the Model 
5 Relay Computers built by Bell Telephone Laboratories for the Aberdeen 
Proving Grounds,' observations in the early period indicated about two 
or three relay failures per day in the 8900 relays of the two computers, repre- 
senting about one failure per two to three million relay operations. The self- 
checking feature meant that these failures did not introduce undetected 
errors. Since the machines were run on an unattended basis over nights and 
week-ends, however, the errors meant that frequently the computations 
came to a halt although often the machines took up new problems. The 
present trend is toward electronic speeds in digital computers where the 
basic elements are somewhat more reliable per operation than relays. How- 
ever, the incidence of isolated failures, even when detected, may seriously 
interfere with the normal use of such machines. Thus it appears desirable 
to examine the next step beyond error detection, namely error correction. 

We shall assume that the transmitting equipment handles information 
in the binary form of a sequence of 0’s and 1’s, This assumption is made 
both for mathematical convenience and because the binary system is the 
natural form for representing the open and closed relays, flip-flop circuits, 
dots and dashes, and perforated tapes that are used in many forms of com- 


munication. Thus each code symbol will be represented by a sequence of 
0’s and 1’s. 


The codes used in this paper are called systematic codes. Systematic codes 
may be defined* as codes in which each code symbol has exactly » binary 
digits, where m digits are associated with the information while the other 
k = n — m digits are used for error detection and correction. This produces 
a redundancy R defined as the ratio of the number of binary digits used to 
the minimum number necessary to convey the same information, that is, 


R = n/m. 


This serves to measure the efficiency of the code as far as the transmission 
of information is concerned, and is the only aspect of the problem discussed 
in any detail here. The redundancy may be said to lower the effective channel 
capacity for sending information. 

The need for error correction having assumed importance only recently, 
very little is known about the economics of the matter. It is clear that in 

Franz Alt, “A Bell Telephone Laboratories’ Computing Machine”—I, II. Mathe 

matical Tables and Other Aids to Computation, Vol. 3, pp. 1-13 and 60-84, Jan. and 
\pr. 1948 

2S. Sparks, and R. G. Kreer, “Tape Relay System for Radio Telegraph Operation,” 
R.C.A. Review, Vol. 8, pp. 393-426, (especially p. 417), 1947. 

§In Section 7 this is shown to be equivalent to a much weaker appearing definition. 
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using such codes there will be extra equipment for encoding and correcting 
errors as well as the lowered effective channel capacity referred to above. 
Because of these considerations applications of these codes may be expected 
to occur first only under extreme conditions. Some typical situations seem 
to be: 
a. unattended operation over long periods of time with the minimum of 
standby equipment. 
b. extremely large and tightly interrelated systems where a sing 
incapacitates the entire installation. 
*. signaling in the presence of noise where it is either impossible or un- 


1 


e failure 


economical to reduce the effect of the noise on the signal. 
These situations are occurring more and more often. The first two are par- 
ticularly true of large scale digital computing machines, while the third 
occurs, among other places, in “‘jamming”’ situations. 

The principles for designing error detecting and correcting codes in the 
cases most likely to be applied first are given in this paper. Circuits for 
implementing these principles may be designed by the application of well- 
known techniques, but the problem is not discussed here. Part I of the paper 
shows how to construct special minimum redundancy codes in the follow- 
ing cases: 

a. single error detecting codes 

b. single error correcting codes 

c. single error correcting plus double error detecting codes. 

Part II discusses the general theory of such codes and proves that under 
the assumptions made the codes of Part I are the “best’’ possible. 


PART I 
SPECIAL CODES 


2. SINGLE ERROR DETECTING CODES 


We may construct a single error detecting code having m binary digits 


in the following manner: In the first » — 1 positions we put » — 1 digits of 
information. In the n-th position we place either 0 or 1, so that the entire 
positions have an even number of 1’s. This is clearly a single error detecting 
code since any single error in transmission would leave an odd number of 
1’s in a code symbol. 

The redundancy of these codes is, since m = n — 1, 


It might appear that to gain a low redundancy we should let » become very 
large. However, by increasing n, the probability of at least one error in a 
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symbol increases; and the risk of a double error, which would pass unde- 
tected, also increases. For example, if p < 1 is the probability of any error, 
then for n so large as 1/p, the probability of a correct symbol is approxi- 
mately 1/e = 0.3679..., while a double error has probability 1/2e = 
O:3859..... 

The type of check used above to determine whether or not the symbol 
has any single error will be used throughout the paper and will be called 
a parity check. The above was an even parity check; had we used an odd 
number of 1’s to determine the setting of the check position it would have 
been an odd parity check. Furthermore, a parity check need not always 
involve all the positions of the symbol but may be a check over selected posi- 
tions only. 


3. SINGLE ERROR CORRECTING CODES 


To construct a single error correcting code we first assign m of the » avail- 
able positions as information positions. We shall regard the m as fixed, but 
the specific positions are left to a later determination. We next assign the k 
remaining positions as check positions. The values in these & positions are 
to be determined in the encoding process by even parity checks over selected 
information positions. 

Let us imagine for the moment that we have received a code symbol, with 
or without an error. Let us apply the & parity checks, in order, and for each 
time the parity check assigns the value observed in its check position we 
write a 0, while for each time the assigned and observed values disagree 
we write a 1. When written from right to left in a line this sequence of k 0’s 
and 1’s (to be distinguished from the values assigned by the parity checks) 
may be regarded as a binary number and will be called the checking number. 
We shall require that this checking number give the position of any single 
error, with the zero value meaning no error in the symbol. Thus the check 
number must describe m + k + 1 different things, so that 


k 
2>m+k+i1 
is a condition on &. Writing n = m + k we find 


yn 
m 2 
2+1° 

Using this inequality we may calculate Table I, which gives the maximum 
m for a given n, or, what is the same thing, the minimum » for a given m. 

We now determine the positions over which each of the various parity 
checks is to be applied. The checking number is obtained digit by digit, 
from right to left, by applying the parity checks in order and writing down 
the corresponding 0 or 1 as the case may be. Since the checking number is 
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TABLE | 


m Corresponding k 





9 
10 
11 


11 
Etc. 


to give the position of any error in a code symbol, any position which has 
a 1 on the right of its binary representation must cause the first check to 
fail. Examining the binary form of the various integers we find 


= 1 
= ‘jl 
101 
= iil 
= 1001 
Etc. 


have a 1 on the extreme right. Thus the first parity check must use positions 


a 


In an exactly similar fashion we find that the second parity check must 
use those positions which have 1’s for the second digit from the right of their 
binary representation, 


10 

11 

110 

i= iil 

1010 

if = 300! 
Etc., 
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the third parity check 


100 
Sm F0i 
6= 110 
7= ii! 
iz 1100 
13 1101 
14 1110 
i9= 1111 
20 = 10100 

Etc. 


It remains to decide for each parity check which positions are to contain 
information and which the check. The choice of the positions 1, 2, 4, 8, --- 
for check positions, as given in the following table, has the advantage of 
making the setting of the check positions independent of each other. All 
other positions are information positions. Thus we obtain Table II. 


TABLE II 


Check Number | Check Positions Positions Checked 


5. 7.9. 08.48.35, W7,->- 
2, 3, 6, 7, 10, 11, 14, 15, 18,--- 
4,5, 6, 7, 12, 13, 14, 15, 20,--- 
8,9, 10, 11, 12, 13, 14, 15, 24,--- 


Pe F 


As an illustration of the above theory we apply it to the case of a seven- 
position code. From Table I we find for nm = 7, m = 4 and k = 3, From 
Table II we find that the first parity check involves positions 1, 3, 5, 7 and 
is used to determine the value in the first position; the second parity check, 
positions 2, 3, 6, 7, and determines the value in the second position; and 
the third parity check, positions 4} 5, 6, 7, and determines the value in posi- 
tion four. This leaves positions 3, 5, 6, 7 as information positions. The results 
of writing down all possible binary numbers using positions 3, 5, 6, 7, and 
then calculating the values in the check positions 1, 2, 4, are shown 
in Table III. 

Thus a seven-position single error correcting code admits of 16 code sym- 
bols. There are, of course, 2’ — 16 = 112 meaningless symbols. In some ap- 
plications it may be desirable to drop the first symbol from the code to 
avoid the all zero combination as either a code symbol or a code symbol plus 
a single error, since this might be confused with no message. This would still 
leave 15 useful code symbols. 
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TABLE III 


Position : e 2 
Decimal Value of 
Symbol 


As an illustration of how this code ‘‘works” let us take the symbol 
0 111 1 0 O corresponding to the decimal value 12 and change the 1 in 
the fifth position to a 0. We now examine the new symbol 


0111000 
by the methods of this section to see how the error is located. From Table I] 
the first parity check is over positions 1, 3, 5, 7 and predicts a 1 for the first 
position while we find a 0 there; hence we write a 
1 

The second parity check is over positions 2, 3, 6, 7, and predicts the second 
position correctly; hence we write a 0 to the left of the 1, obtaining 

01 
The third parity check is over positions 4, 5, 6, 7 and predicts wrongly; hence 
we write a 1 to the left of the 0 1, obtaining 

101 
This sequence of 0’s and 1’s regarded as a binary number is the number 5; 
hence the error is in the fifth position. The correct symbol is therefore ob 
tained by changing the 0 in the fifth position to a 1. 


4. SINGLE ERROR CORRECTING PLus DouBLeE ERROR DETECTING CODES 


To construct a single error correcting plus double error detecting code we 
begin with a single error correcting code. To this code we add one more posi- 
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tion for checking all the previous positions, using an even parity check. ‘To 
see the operation of this code we have to examine a number of cases: 

1. No errors. All parity checks, including the last, are satisfied. 

2. Single error. The last parity check fails in all such situations whether 
the error be in the information, the original check positions, or the last 
check position. The original checking number gives the position of the 
error, where now the zero value means the last check position. 

Two errors. In all such situations the last parity check is satisfied, and 
the checking number indicates some kind of error. 

As an illustration let us construct an eight-position code from the previous 
seven-position code. To do this we add an eighth position which is chosen 
so that there are an even number of 1’s in the eight positions. Thus we add 
an eighth column to Table III which has: 


TABLE IV 


0 
0 
1 


PART II 
GENERAL THEORY 
5. A GEOMETRICAL MODEL 


When examining various problems connected with error detecting and 
correcting codes it is often convenient to introduce a geometric model. 
The model used here consists in identifying the various sequences of 0’s and 
1’s which are the symbols of a code with vertices of a unit #-dimensional 
cube. The code points, labelled x, y, z,--- , form a subset of the set of all 
vertices of the cube. 


Into this space of 2" points we introduce a disfance, or, as it is usually 
called, a metric, D(x, y). The definition of the metric is based on the observa- 


tion that a single error in a code point changes one coordinate, two errors, 
two coordinates, and in general d errors produce a difference in d coordinates. 
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Thus we define the distance D(x, y) between two points x and y as the num- 
ber of coordinates for which x and y are different. This is the same as the 
least number of edges which must be traversed in going from x to y. This 
distance function satisfies the usual three conditions for a metric, namely, 


D(x, y) = 0 if and only if x = y 
D(x, y) = Dy, x) > 0 ifx Ay 
Diz, vy) + Dy, z) = D(x, z) (triangle inequality). 


As an example we note that each of the following code points in the three- 
dimensional cube is two units away from the others, 


001 
010 
100 
i & FF 


To continue the geometric language, a sphere of radius r about a point x 
is defined as all points which are at a distance r from the point x. Thus, in 
the above example, the first three code points are on a sphere of radius 2 
about the point (1, 1, 1). In fact, in this example any one code point may be 
chosen as the center and the other three will lie on the surface of a sphere 
of radius 2. 

If all the code points are at a distance of at least 2 from each other, then it 
follows that any single error will carry a code point over to a point that is 
not a code point, and hence is a meaningless symbol. This in turn means that 
any single error is detectable. If the minimum distance between code points 
is at least three units then any single error will leave the point nearer to the 


correct code point than to any other code point, and this means that any 


single error will be correctable. This type of information is summarized in 
the following table: 


TABLE V 


Minimum Meaning 
Distance 


uniqueness 

single error detection 

single error correction 

single error correction plus double error detection 
double error correction 


Etc 


Conversely, it is evident that, if we are to effect the detection and correc- 
tion listed, then all the distances between code points must equal or exceed 
the minimum distance listed. Thus the problem of finding suitable codes is 
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the same as that of finding subsets of points in the space which maintain at 
least the minimum distance condition. The special codes in sections 2, 3, 
and 4 were merely descriptions of how to choose a particular subset of points 
for minimum distances 2, 3, and 4 respectively. 

It should perhaps be noted that, at a given minimum distance, some of 
the correctability may be exchanged for more detectability. For example, a 
subset with minimum distance 5 may be used for: 

a. double error correction, (with, of course, double error detection). 

b. single error correction plus triple error detection. 

c. quadruple error detection. 

Returning for the moment to the particular codes constructed in Part I 
we note that any interchanges of positions in a code do not change the code 
in any essential way. Neither does interchanging the 0’s and 1’s in any posi- 
tion, a process usually called complementing. This idea is made more precise 
in the following definition: 

Definition. Two codes are said to be equivalent to each other if, by a finite 
number of the following operations, one can be transformed into the other: 

1. The interchange of any two positions in the code symbols. 

2. The complementing of the values in any position in the code symbols. 
This is a formal equivalence relation (~) since A ~ A; A ~ B implies 
B~ A;and A ~ B, B~C implies A ~ C. Thus we can reduce the study 
of a class of codes to the study of typical members of each equivalence class. 

In terms of the geometric model, equivalence transformations amount to 
rotations and reflections of the unit cube. 


6. SINGLE ERROR DETECTING CODES 


The problem studied in this section is that of packing the maximum num- 
ber of points in a unit n-dimensional cube such that no two points are closer 
than 2 units from each other. We shall show that, as in section 2, 2" points 
can be so packed, and, further, that any such optimal packing is equivalent 
to that used in section 2. 

To prove these statements we first observe that the vertices of the »- 
dimensional cube are composed of those of two (” — 1)-dimensional cubes. 
Let A be the maximum number of points packed in the original cube. Then 
one of the two (x — 1)-dimensional cubes has at least 4/2 points. This cube 
being again decomposed into two lower dimensional cubes, we find that one 
of them has at least 4/2? points. Continuing in this way we come to a two- 
dimensional cube having A/2"~” points. We now observe that a square can 
have at most two points separated by at least two units; hence the original 


n-dimensional cube had at most 2"! points not less than two units apart. 
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To prove the equivalence of any two optimal packings we note that, if 
the packing is optimal, then each of the two sub-cubes has half the points. 
Calling this the first coordinate we see that half the points have a 0 and half 
have a 1. The next subdivision will again divide these into two equal groups 
having 0’s and 1’s respectively. After ( — 1) such stages we have, upon re- 
ordering the assigned values if there be any, exactly the first » — 1 positions 
of the code devised in section 2. To each sequence of the first » — 1 coordi- 
nates there exist m -- 1 other sequences which differ from it by one co- 
ordinate. Once we fix the n-th coordinate of some one point, say the origin 
which has all 0’s, then to maintain the known minimum distance of two 
units between code points the m-th coordinate is uniquely determined for all 
other code points. Thus the last coordinate is determined within a comple- 
mentation so that any optimal code is equivalent to that given in section 2. 

It is interesting to note that in these two proofs we have used only the 
assumption that the code symbols are all of length n. 


7. SINGLE ERROR CORRECTING CODES 


It has probably been noted by the reader that, in the particular codes of 
Part I, a distinction was made between information and check positions, 
while, in the geometric model, there is no real distinction between the various 
coordinates. To bring the two treatments more in line with each other we re- 
define a systematic code as a code whose symbol lengths are all equal and 

1. The positions checked are independent of the information contained 

in the symbol. 

2. The checks are independent of each other. 

3. We use parity checks. 

This is equivalent to the earlier definition. To show this we form a matrix 
whose i-th row has 1’s in the positions of the 7-th parity check and 0’s else 
where. By assumption 1 the matrix is fixed and does not change from code 
symbol to code symbol. From 2 the rank of the matrix is &. This in turn 
means that the system can be ,olved for & of the positions expressed in 
terms of the other » —. k& positions. Assumption 3 indicates that in this 
solving we use the arithmetic in which 1 + 1 = 0. 

There exist non-systematic codes, but so far none have been found which 
for a given » and minimum distance d have more code symbols than a sys- 
tematic code. Section 9 gives an example of a non-systematic code. 

Turning to the main problem of this section we find from Table V that a 
single error correcting code has code points at least three units from each 
other. Thus each point may be surrounded by a sphere of radius 1 with no 
two spheres having a point in common. Each sphere has a center point and 
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n points on its surface, a total of m + 1 points. Thus the space of 2” points 
can have at most: 
yn 
n+ 1 
spheres. This is exactly the bound we found before in section 3. 

While we have shown that the special single error correcting code con- 
structed in section 3 is of minimum redundancy, we cannot show that all 
optimal codes are equivalent, since the following trivial example shows that 
this is not so. For nm = 4 we find from Table I that m = 1 and k = 3. Thus 
there are at most two code symbols in a four-position code. The following 
two optimal codes are clearly not equivalent: 


O000 ; O00 0 
Ss ee SG ke 


8. SINGLE ERROR CORRECTING PLUS DouBLE ERROR DETECTING CODES 


In this section we shall prove that the codes constructed in section 4 are 
of minimum redundancy. We have already shown in section 4 how, for a 
minimum redundancy code of n — 1 dimensions with a minimum distance 
of 3, we can construct an » dimensional code having the same number of 
code symbols but with a minimum distance of 4. If this were not of minimum 
redundancy there would exist a code having more code symbols but with 
the same m and the same minimum distance + between them. Taking this 
code we remove the last coordinate. This reduces the dimension from n to 
n — 1 and the minimum distance between code symbols by, at most, one 
unit, while leaving the number of code symbols the same. This contradicts 
the assumption that the code we began our construction with was of mini- 
mum reduncancy. Thus the codes of section 4 are of minimum redundancy. 

This is a special case of the following general theorem: To any minimum 
redundancy code of V points in  — 1 dimensions and having a minimum 
distance of 2k — 1 there corresponds a minimum redundancy code of .V 
points in 2 dimensions having a minimum distance of 2k, and conversely. 
To construct the » dimensional code from the » — 1 dimensional code we 
simply add a single n-th coordinate which is fixed by an even parity check 
over the m positions. This also increases the minimum distance by 1 for 
the following reason: Any two points which, in the 2 — 1 dimensional code, 
were at a distance 2k — 1 from each other had an odd number of differences 
between their coordinates. Thus the parity check was set oppositely for the 
two points, increasing the distance between them to 2k. The additional co- 


ordinate could not decrease any distances, so that all points in the code are 


now at a minimum distance of 2k. To go in the reverse direction we simply 
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drop one coordinate from the » dimensional code. This reduces the minimum 
distance of 2k to 2k — 1 while leaving V the same. It is clear that if one 
code is of minimum redundancy then the other is, too. 
9, MISCELLANEOUS OBSERVATIONS 
For the next case, minimum distance of five units, one can surround each 
code point by a sphere of radius 2. Each sphere will contain 
1+ C(m, 1) + C(n, 2) 


points, where C(n, &) is the binomial coefficient, so that an upper bound on 
the number of code points in a systematic code is 


>” pnt 


~ 


1+ C(n,1)+C(n, 2) wt+tn+2 


a sa 

This bound is too high. For example, in the case of m = 7, we find that 
m = 2 so that there should be a code with four code points. The maximum 
possible, as can be easily found by trial and error, is two. 

In a similar fashion a bound on the number of code points may be found 
whenever the minimum distance between code points is an odd number. 
A bound on the even cases can then be found by use of the general theorem 
of the preceding section. These bounds are, in general, too high, as the above 
example shows. 

If we write the bound on the number of code points in a unit cube of dimen- 
sion ” and with minimum distance d between them as B(n, d), then the 
information of this type in the present paper may be summarized as follows: 


Bin, 
B(n, 


Bi(n, ; 


B(n, 4) 


n 


Bln — 1, 2k — = B(n, 2k) 


B(n, 2k — 1) = 2™ << 
—~ 1+ Cin, 1) + --- + C(n, k — 1) 
While these bounds have been attained for certain cases, no general 
methods have yet been found for contructing optimal codes when the mini- 
mum distance between code points exceeds four units, nor is it known 
whether the bound is or is not attainable by systematic codes. 
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We have dealt mainly with systematic codes. The existence of non-sys- 
tematic codes is proved by the following example of a single error correcting 
code with n = 6. 


000000 
010101 
100110 
111000 
O10 4-4 
111111 


The all 0 symbol indicates that any parity check must be an even one. 
The all 1 symbol indicates that each parity check must involve an even num- 
ber of positions. A direct comparison indicates that since no two columns 
are the same the even parity checks must involve four or six positions. An 
examination of the second symbol, which has three 1’s in it, indicates that 
no six-position parity check can exist. Trying now the four-position parity 
checks we find that 


‘fe 


are two independent parity checks and that no third one is independent of 
these two. Two parity checks can at most locate four positions, and, since 
there are six positions in the code, these two parity checks are not enough 
to locate any single error. The code is, however, single error correcting since 
it satisfies the minimum distance condition of three units. 

The only previous work in the field of error correction that has appeared 
in print, so far as the author is aware, is that of M. J. E. Golay.‘ 


'M. J. E. Golay, Correspondence, Notes on Digital Coding, Proceedings of the I.R.E., 
Vol. 37, p. 657, June 1949. 





Optical Properties and the Electro-optic and Photoelastic 
Effects in Crystals Expressed in Tensor Form 


By W. P. MASON 


I. INTRODUCTION 


HE electro-optic and photoelastic effects in crystals were first investi- 

gated by Pockels,! who developed a phenomenological theory for these 
effects and measured the constants for a number of crystals. Since then not 
much work has been done on the subject till the very large electro-optic 
effects were discovered in two tetragonal crystals ammonium dihydrogen 
phosphate (ADP) and potassium dihydrogen phosphate (KDP). With these 
crystals light modulators can be obtained which work on voltages of 2000 
volts or less. Their use has been suggested? in such equipment as light valves 
for sound on film recording and in television systems. Furthermore, since 
the electro-optic effect depends on a change in the dielectric constant with 
voltage, and the dielectric constant is known to follow the field up to 10” 
cycles, it is obvious that this effect can be used to produce very short light 
pulses which may be of interest for physical investigations and for strobo- 
scopic instruments of very high resolution. Hence these crystals renew an 
interest in the electro-optic effect. 

In looking over the literature on the electro-optic effect and photoelastic 
effect in crystals, there do not seem to be any derivations that give them 
in terms of thermodynamic potentials, which allow one to investigate the 
condition under which equalities occur between the various electro-optic 
and photoelastic cynstants. Hence it is the purpose of this paper to give such 
a derivation. Another object is to give a derivation of Maxwell’s equations 
in tensor form, and to apply them to the derivation of the Fresnel ellipsoid. 

The first sections deal with the optics of crystals, and derive the Fresnel 
ellipsoid from Maxwell’s equations. Other sections give a derivation of the 
two effects, discuss methods for measuring them by determining the bi- 
refrigence in various directions and give the constants for the two effects in 
terms of crystal symmetries. The final section discusses the application of 
the photoelastic effect for measuring strains in isotropic media. 

1 F, Péckels, Lehrbuck Der Kristalloptic, B. Teubner, Leipzig, 1906. 

2See Patent 2,467,325 issued to the writer; “Light Modulation by P type Crystals,” 


George D. Gotschall, Jour. Soc. Motion Picture Engineers, July, 1948, pp. 13-20; B. H 
Billings, Jour. Opt. Soc. Am., 39, 797, 802 (1949). 
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II. SOLUTION OF MAXWELL’S Equations IN TENSOR FORM 


In tensor notation, Maxwell’s equations for a nonmagnetic medium with 
no free charges take the form 
1 aD; oH; 1 0H; IE, 0H; 
— 5 ae Oe iS = 0; = = 0 (1) 
V al OX}, V at x OX; 
where D; is the electric displacement, H; the magnetic field, EZ; the electric 
field, V the velocity of light in vacuo and €,; a tensor equal to zero when 
i= jorkorj = k, but equal to 1 or —1 when all three numbers are different. 
If the numbers are in rotation, i.e. 1, 2, 3; 2, 3, 1; 3, 1, 2 the value is +1 
while, if they are out of rotation, the value is —1. 
We assume the electric vector to be representable by a plane wave whose 
planes of equal phase are taken normal to the unit vector n; . Then 


ie = Eo, gvtimrini «) a 


where /», are constants representing the maximum values of the field along 
the three rectangular coordinates and 7 = Y —1. Substituting (2) in the 
second of equations (1), noting that o, are not functions of the space co- 
ordinates, we have 

10H; jw 


V at cag ej 1 Eo, nile 


jwlt—xjny/r 


Integrating with respect to the time 


| jwlt—x jn 


. i/v) w(t{—zryng/v 
H; —_ [ejni Fo, nile aoe Hye ; 1 . 


Hence, 
J 


¢ 


lejxi Eo, nil 


Ho, 


and therefore the magnetic vector is normal to the plane determined by 
Eo, and n;. 
Next, using the first of equations (1), 
‘ P 9 jwlt—rzpn,/v) 
oD, OH; , "i acl 
— € i ik -_ = Ve xj Hy, 


al OX; OX; 

; (6) 
jw 

cf 


jwlt—rpny/v) 
€ ijk Ho; nile ° 
Integrating with respect to time, 


D, ve a lesin Ho, mle" TMK! 


¢ 
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Inserting the value of Ho; from (5), this equation takes the form 


2 


jelt—z;nj/v) 


dD; se ~ ae [esse (€jxs Eo, ni)nele 


and, in general, 


"i . 
dD; = —-- lesje(€ jes ky ni) nx. 


» 
qe 


Expanding the inner parenthesis, we have the components 
(Kons = E3n2)1; (E3ny —_ FE\n3)2; (Eyez on Fon). 
Then 


€:;«((Fen3 — Exne); (Eym, — Eyns); (Evm2 — Exm)|ne gives 


aa. 
— [((E3m — Eyns3)n3 — (Eyn, — Em) ne] 


= [(E3n3 + Eom, + Eym)m — E\(ni + nz + n3)] 


V : . : 7 
——|(E\n. — E.m)m — (E.n3 — E3nz)ns] 


a2 


= [(E3n3 + Eon, + Eym)nz, — Ex(ni + nz + n3)] 
V 


9 


(Eons — E3n2)n2 — (Es3m — Eyns)m) 
= [(E3n3 + Eom, + Er.m)ns — Ex(n; + oe + ak 


Now, since 1? + n} + nj = 1 because 1 is a unit vector, we have 


A [E; — (Ej; nj)ni] or z D; — Ey — (Ej nj)ny = 0. (12) 


9 
”) 
¢ 


This equation states that D;, E; and n; are in the same plane, H; being 
normal to the plane as shown by Fig. 1. The energy flow vector 


J 
S; = €ijx Ej Hy (13) 
4n 


also lies in the plane since it is perpendicular to E and H. It is at the same 
angle @ with that E is with D. The velocity of energy flow is v/cos 6. The 
energy velocity is called the ray velocity and the energy path the ray path. 


Next, from the relation for a material medium, that 


D; = K,;E; or conversely E; = 8;:D, (14) 
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where K;; are the dielectric constants measured at optical frequencies and 
8;; are the impermeability constants determined from the relations 

By: = A*/a* (15) 


where 


i 
A* = | Ki2 
Ki 


:, th th 
and A” the determinant obtained by suppressing the 7" row and i column, 
we can eliminate £; from equation (12) and obtain 


- Dy = BuDi + Bie D2 + Bis D3 — (E;n;)ny 


— D: = B12 Di + B22 D2 + B23 Ds — (Ej nj)ne 
ri Ds = BD: + BurDs + Bas Ds — (Ejnj)m, 
This can be put in the form 
(Eynj)m = D[Bu — v/V?] + BeDe + BisD3 
(Ejn;)n2 = BD, + (B22 — v*/V?)D2 + Be3Ds 
(Eyn;)n3 = B13), + Bo3D2 + (B33 — v®/V?) Ds. 
Solving for D; , D. and D, 
D, = [(Bx2 — v°/V*)(B33 — v°/V2) — Bos/[Ejn,]m 
D2 = [(Bu — 0°/V?)(B33 — 2°/V2) — Bisl[E,n,]me 
Ds; = [(Bu — 2°/V2)(B22 — v*/V2) — Bie|[Ejny]ns. 
Now, since D and are at right angles, 
Din, + Dong + Dong = 0. 
Hence, 
0 = [(Boe2 — v?/V?)(B33 — v2/V2) — Bos|ni 
+ [(Bu — v2/V2)(B35 — v°/V2) — Bis|n) (20) 


PS) Bralite, 
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Fig. 1—Position of electric, magnetic and normal vectors for an electromagnetic plane 
wave in a crystal. 











x 
Fig. 2—Rotated axes and angles for relating them to unrotated axes 


Be; = Oand using 


By choosing the original x, y, z axes so that By = Bis 
Be , 833 = B; this gives the equation 


the values 6, = 81, G2 = 
ny n3 
- = (, 


nz; = O and the two velocities 


For transmission along the X 


are given by 
rs -2 ( ??) 
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Similarly the third velocity v? = 6,V? = a? can also be used and equation 
(21) reduces to 


») € 9 
ny n nN 
+ +. + —— 


e— # F— # 


2 a 
= 


This is a quadratic equation for the velocities v in terms of the principal 
velocities a, 6 and ¢ which are usually taken so that a > b > c. 
Solving for the velocities, we obtain the quadratic equation 


v— o[n3(b? + 2) + nia? + 2) + nat b*)| 
: . ; (24) 
+ mb? + nae + nab? = 0. 
Letting L = nji(b? — @), M = ni(c2 — a2), N = n3(a? — 82) the solutions 
for the velocities become 


20” = nilb- + ¢) of nile + @) ob nila” of b°) 
+ V/L? + M?+ N? — 2LM — 2LN — 2MN- 


25) 
This equation can be put into a simpler form if we change to the coordinate 
system shown by Fig. 2. Here the rotated system is related to the original 
system by three angles 6, 9, y. 6 is the angle between the Z’ axis and the 
Z axis, ¢ is the angle the plane containing Z and Z’ makes with the X axis 
while y represents a rotation of the primed coordinate systems about the 


op . rh . . . ° . . 
Z axis. The direction cosines for the primed system with respect to the 
normal system are designated by the matrix 


el ae 
X’| 4; m, n 
Y’ | lo me Me 
Z’ t; Ms N3 

where, in terms of 6, ¢ and y, these direction cosines are, 

{; = cos cos g cos Wy — sin ¢g sin y, 

m, = cos @sin ¢ cosy + cos ¢ sin y, nm, = — sin @ cosy 
f2 = —cos@cos ¢gsinw — sin ¢g cosy, 
Ms. = cos ¢ cos Y — sing siny cos 8, nm. = sin 6 sin y 
{; = cos ¢ sin 8, m; = sin ¢ sin @, nz; = cos 6. (27) 


If we take Z’ as the direction of the wave normal, then in equation (25) 


nm = (3, = m3, nz = Nz 
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and the equation for the velocities becomes 


= a (sin’ gsin’ 6 + cos 0) + b'(cos’ vg sin’ 6 + cos 6) + ¢ sin’ @ 
* ‘ig — b*)*(cos? 6 cos? g + sin? ¢)? + 2(a? — 8*)(c? — &) 


sin? 0(cos? 6 cos? g — sin? g) + (c? — 8)? sint g 


(28 


A very elegant construction for the wave-velocities and the directions of 
vibration is the Fresnel index ellipsoid. Consider the ellipsoid 


a*x? + By? + cz? = | (29) 


Then Fresnel’ showed that, for any diametral plane perpendicular to the 
wave normal, the two principal axes of the ellipse were the directions of the 
two permitted vibrations, while the wave velocities were the reciprocals of 
the principal semi-axes. 

We wish to show now that the maximum and minimum values of the im 
permeability constants in a plane perpendicular to the direction of the 
wave normal determine the directions of vibration and the values of the two 
velocities. To show this we make use of the fact that 8;; is a second rank 
tensor and transforms according to the tensor transformation formula 


OXE OXE 


where the partial derivatives are the direction cosines 


/ , 


/ ° 
Ox, OX, OX 
—— = 1, — 104, . 
Ox, OXe OX3 


“ / - , 
dxs OX: OX 
fs ’ _ 


Ox} OXe Ox; 
i / - / ™ / 
OX; OX3 OX3 
f3, : = M3, - 
Ox, 0X2 OX3 
Expanding equation (30) the six transformation equations become 
! 2 2 2 
Bur = GBu + 26m By + 261813 + miBe2 + 2mm Be; + 11833 
/ 
Bie = O681 + (fim + myl> )Bio + (¢\n2 oa ny fs )B13 + MMo3o0 
+ (mre + nymes)Bo, +- ny NeBa3 
/ 
Bi3 = GlsBu + (Gms + mils)Bie + (fins + mfs)B13 + mymsBn 
+- (nym; + mytt3)Ber + my 22833 


3See for example “Photoelasticity,” Coker and Filon, Cambridge University Press, 
pages 17 and 18 
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a!’ 2 2 P 2 
Boo = 281 + 26me2B12 + 262813 + moBo2 + 2meneBe3 + 2833 
/ 
Bo3 = fofsBu + (fms + mols)Bi2 + (43 + mols)Bis + memsBo 
+ (mens; + n2M3)Be3 + NoN3333 
4 2 é 2 P 2 
B33 = (38u + 2¢3msB12 + 2633813 + m3Be2 + 2m3n3823 + 13833. 
Now, if the axes refer to the axes of a Fresnel ellipsoid, 812 = 813 = B23 = 0 
° ege ° ° / 
and one of the impermeability constants for any direction, say 833 , can be 
expressed in the form 


/ 2 2 2 
B33 = £338; + m3Be + 1383 (32) 
If r, which lies along Z’ of Fig. 2, is the radius vector of the Fresnel ellipsoid, 
then the direction cosines f3 , m3 and 3 are 


kas y 


M3 = 
From equation (24) 6; = 


becomes 


217g° 22 2.2 22 
PV'bs = ae + By + ce = 1. 


° ° e " . . . tyr2y! 
Hence the square of the radius vector of the Fresnel ellipsoid is 1/V ‘833 


and the radius vector of the impermeability ellipsoid agrees with that of the 
Fresnel ellipsoid. Hence, the directions of vibration can be determined from 
the principal axes of the impermeability ellipsoid for any diametral plane. 
When light transmission occurs along Z’, the direction for maximum and 
. . . a . . / 
minimum impermeability can be obtained by evaluating 8), and deter- 
mining the angle y for which it has an extreme value. Inserting the direction 
cosines { , m, and m from equation (27), we find 
+ ) + ) “Nc 
’ 2 2 2 sin 2g sin 2y cos 8 eo 
Bit By cos @ cos gcos y — ; v + sing sin py 
sin 2 sin 2W cos 0 
) 


+ By} cos @sin’ yg cos py 4+ + cos gsin y 


+ Bs sin” 0 cos” vy. 


Differentiating with respect to y and setting the resultant derivative equal 
to zero, the value of y that will satisfy the equation is given by 


(Bo = B1) sin 20 cos @ 
(31 — 82) (cos? 6 cos? ¢ — sin® ¢) + (83 — Be) sin? é 


tan 2y 
(34) 


(b° — a’) sin 2¢ cos 0 
(a? — 6) (cos? @ cos? ¢ — sin? ¢) + (c? — 6?) sin? 6" 
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For a given value on the right-hand side there are two values of ¥, 90° apart, 
that will satisfy the equation and hence we have two directions of vibration 
at right angles to each other. Inserting (34) in (33) the values of Bi and 
B11 for these two directions are 


ae * 9 * 9 ° , ° ° 9 9 r * 9 
2811 = Bi(sin® ¢ sin? @+ cos? @) + B2(cos* ¢ sin? @ + cos? 6) + 8s sin? 


re (81 es, Be)? (cos? é cos” va) + sin? ¢)* + 2(8, — B2)(B3 ua B:) 
- sin? 6 (cos? 8 cos? ¢ — sin? ¢) + (83 — 82)? sin‘ @. 


Since 8; corresponds to a’, etc., this equation agrees with the two velocities 
given in equation (28) and shows that the directions of vibration correspond 
with the maximum and minimum values of Bn. 

It can also be shown that the two directions of electric displacement co- 
incide with the two values of y given by equation (34). Transforming the 
electrical displacements to the X’, Y’, Z’ set of axes we have 
dx 4, Om 


oF 
ax1 + 9 D3; : {,D, + m, De» a ny Ds, 


D. 
, Ox, Ox i T om x3 


D; - D,+ s Dz + — Onn D; £2D; + m2, Dz + mD; 


Ox 


D; - Di + = + = a = {;D; + m;D, + nD. 
v1 


Hence, inserting the values of D,, D2, D; from equation (18), we find 
/ Pus , , Fry , 
dD, = £1 65(Be = B11) (Bs coe B11) +. mym3(B, mi B11)(B3 _ B11) 
! 4 ; 
- 1yN3( By os B11) (Be - O41) 
, ae , ’ , , 
Dz = 5 3( Be —_ B11) (Bs _ B11) + moms(3, — G11) (D3 — (311 
Tr NeNt3(01 — By1)(B2 — By) 
9 / / : 9 , , 
(3( Be ae B11) (B3 — By) + m3(3, - B11)(B3 — B11) 
T n3(B, ~ B11) Peo O11). 


From equation (20) with Bi = 813; = 3 = 0, it is evident that the D; com 
ponent vanishes and hence the two values of electric displacement lie in a 
plane perpendicular to Z’. By inserting the values of By, and the value of 
y found from equation (34) we find that D, = 0 and hence the electric dis- 
placement lies along the directions of the greatest value of Bi . Similarly, 
from the second value of 81, , D; vanishes and hence the second wave is per- 
pendicular to the first and in the direction of the smallest value of Se. 
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III. Location oF Optic AXES IN A CRYSTAL 


When the expression in the radical of equation (28) vanishes the two 
velocities are equal and an optic axis exists. Since the expression inside the 
radical can be written 


((a2 — b*)(cos* dcos?y + sin? y) — (6? — c*)sin® OP 
(37) 
— 4(a? — b*)(c? — 8) sin? Osin?g = 0 
then,.since the square is always positive and since (a?7— 8?) > O and 
(b? — c*) > O, the equation can vanish only if g¢ = 0. But ¢ = 0 indicates 
that the two optic axes always lie in a plane perpendicular to the inter- 
mediate velocity 6. With ¢ = 0 then the square vanishes when 
tan’ 0 (a ze . or tan@é= + ee x. (38) 
(2 — ¢) 2— ¢ : 
If (a2 — b*) < (8 — c*) the value of the tan @ is less than unity and the 
crystal is called a positive crystal. For this case the two axes approach more 
closely the Z axis having the velocity c than they do the X axis. If 
(a2 — 8?) > (b? — c*) the crystal is negative. 
Ifa = bor b = ¢ the crystal has a single optic axis and is respectively a 
positive or negative uniaxial crystal. For the first case the two velocities 
are given by 


1 =a = 8, » = Va? cos? 6 + c sin? 6. (39) 


The first velocity is that of the ordinary ray while that of the second is that 
of the extraordinary ray. Since a > c, the ordinary ray will have a velocity 
greater than the extraordinary ray except along the optic axis where they 
are equal. Since c < a, the maximum axis for any ellipse, formed by inter- 
secting the Fresnel ellipsoid at an angle to the optic axis, will lie in the plane 
formed by the normal and the c axis and hence the direction of polarization 
of the extraordinary ray will lie in the c, 2 plane. The polarization of the 
ordinary ray will be perpendicular to this plane. 

If b = ¢ the a axis is the optic axis and the velocities of the two rays are 
again 


vw) = cand 2 = a’ (1-sin’ 6cos g) + ¢ (sin’ Ocos’ ¢) (40) 


Hence, when @= 90°, ¢ = 0°, the two velocities are equal and a is the optic 
axis. In this case the velocity of the extraordinary ray is greater than that 
of the ordinary ray except along the a axis, and the crystal is a negative 
uniaxial crystal. The polarization of the extraordinary ray lies again in the 
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plane of the normal and the optic axis while the ordinary ray is perpendicu- 
lar to it. 


IV. DERIVATION OF THE ELECTRO-OPTIC AND PHOTOELASTIC EFFECTS 


In a previous paper’ and in the book “Piezoelectric Crystals and Their 
Application to Ultrasonics’, D. Van Nostrand, 1950, it was shown that the 
electro-optic and photoelastic effects can be expressed as third derivatives 
of one of the thermodynamic potentials. Probably the most fundamental 
way of developing these properties is to express them in terms of the strains, 
electric displacements and the entropy. For viscoelastic substances it has 
been shown that the photoelastic effects are directly related to the strains. 
In terms of the electric displacements, the electro-optic constants do not 
vary much with temperature whereas, if they are expressed in terms of the 
fields, the constants of a ferroelectric type of crystal such as KDP increase 
many fold near the Curie temperature. The entropy is chosen as the funda- 
mental heat variable, since most measurements are carried out so rapidly 
that the entropy does not vary. 

The thermodynamic potential which has the strains, electric displace- 
ments and entropy as the independent variables is the internal energy U’, 
given by 


Dn 
dU = T;; Si; + Em = + Odo (41) 
T 


where S;; are the strains, 7; the stresses, E,,.the fields, D,, the electric dis- 
placements,@ the temperature and o the entropy. In this equation the 
strains S;; are defined in the tensor form 


“ 1 fou Ou 
Sip = st + — (42) 
Zz OX; Ox; 


where the w’s are the displacements along the three axis. In the case of a 
shearing strain occurring when i # j, the strain is only half that usually 
used in engineering practice. In order to avoid writing the factor 1/41, we 
use the variable 6,,.= D,,/4mr. Then, from (41), 


” ou’ a ou 
OS i; . Be 05,” Oo 


. au’ ‘ 
d6= - (43) 
Since, for most conditions of interest, adiabatic conditions prevail, we can 
set do equal to zero and can develop the dependent variables, the fields and 


‘“First and Second Order Equations for Piezoelectric Crystals Expressed in Tensor 
Form,” W. P. Mason, B.S.T.J., Vol. 26, pp. 80-138, Jan., 1947 
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the stresses in terms of the independent variables, the strains and the elec- 
tric displacements. Up to the second derivatives, these are 
4 ) OE m i 


S ; On 
aSy T 36, 


1 Em 29° Em 3° Em 
Ss ne Oe = Sah, + ee dd. t+ 
: 1 pee iSar + AS45965,” 05, 05, 


E 


af OT :-4 S., + IT x 


Tt ij 
OS i; : 


n 


06», 


1[ a Th 20° Tee Tut 
g SuiSar + = Sij5n + ——— bnd0 | + 
"2 Ee OSq Sj Dn 08, 05.” 


For the electro-optic and photoelastic cases, the two tensors of interest are 


a T; t ad ae U ’ x E,, 


ae = = — AtMrlno 
IS.£96n060  ISxtd5_ vi 


06, 065 
dE aU ee 

ae on angle a a aa = (49) Fmno- 
O0n O05 06m O6n O60 


(45) 


For the first partial derivatives, we have the values 


OT). OT. 0U OE 
—— = € set 5 = = ve oe = ere = —Nnre 
OS 5; O06» OS 06», OS pe 
OE : s 
- mn = Ar Binn 
00, 
I . ‘or . . 

where c;,,¢ are the elastic stiffnesses measured at constant electric displace- 
ment, /i,.¢ are the piezoelectric constants that relate the open circuit voltages 
to the strains, and 8;,,, are the impermeability constants measured for con- 
stant strain. 

With these substitutions and neglecting the other second partial deriva- 


tives, we have, from (44), 


. —— as ; ois 
Oe —hinii S35 + Dra E + Msimn Si; + D. | 4 


2 


) h., oo 
Tit = CintSi; + D. okt Mud dD, . 
4 2 ; 


This equation shows that there is a relation between the change in the im- 
permeability constant due to stress in the first equation, and the electro- 
strictive constant in the second equation through the tensor mj jm, . These 
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effects, however, have to be measured at the same frequency before equality 
exists. 

To obtain the changes in the optical properties caused by the strain and 
the electric displacement we have to determine the fields and displacements 
occurring at the high frequencies of optics. Even for piezoelectric vibrations 
occurring at as high frequencies as they can be driven by the piezoelectric 
effect, these frequencies are small compared to the optic frequencies f and 
can be considered to be static displacements or strains. Hence, writing 


. 0 , jt jut 
y —= Do + Buc € ; Dp, — iim +- D, 


D, = Dp. +- D9". A,’ = Si5 


ij 
where w = 2zf, the first of equation (47) can be written in the form 


0 . : 0 s . . Vmno 70 
Em = —lmigSij + Dn | Ban + MijmnSizs + — ~ Do 


Ss 
. jwt jot os - T'mno 0 IT mno 0 wt 
Ene’ = Dre Bows +- MNiimn¢ +> : D, + me 3  D. é . 
J 2 ) 


If we develop one of the fields, say £, , this can be written in the form 


’ jut + = - ( iw 
Exe — [Bu - Miu 4 riiDi + ri12D2 4 rii3D3\Die y 


+ [B12 + Mise Sij + riexDi + riveD2 + rizgD3]Dee* — (49) 
+ [B15 + M 5713 Si; 4 rin Dy + 1139 Ds + 1133 D3)D3e" 


where the first number of r refers to the field, the second to the optical value 
of D and the third to the static value of D. Hence, for the general case, 


. wt qwt ' ’ ' ( ~ 
Ene _ Dire [Bmn 5 Mejmn > ij a a (50) 


From the definition of the two tensors mjjno and Tmo given by equation 
(45), we can show that there are relations between the various components 
of the tensors. For the first tensor mj jn. , since S;; = Sj; is a symmetrical 
tensor, then 


Mijno = Mino (51) 


From the definition of the tensor m;;,. in the form 


F) au 
4tMijno = =a — 
as FY 06n 05 


it is obvious that we can interchange the order of 6, and 6, so that 


Mijno = Mijon 
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Since ij and no are reversible, it has been customary to abbreviate the tensor 
by writing one number in place of the two in the following form: 

11 = 1;22 = 2;33 = 3312 = 21 = 6;13 = 31 = 5;23 = 32=4 (52) 
Since the reduced tensor is associated with the engineering strains, it is 
necessary to investigate the numerical relationships between the four in- 
dex symbols and the two index symbols. From equation (48), when m 
~ n, the change in the impermeability constant Bn, is given by 

Mijmn Sig + Myimn Sji = Mrs Sr (53) 
Since S, = 2S;; = 2S;; we have the relation that 
Mijmn = Mrs(1,7, m,n = 1 to 3,7, 5, = 1 to 6) (54) 
In equation (45) we cannot in general interchange the order of 7 and no 
since LU’ does not contain product terms of strains and electric displace- 
ments and hence in general 
Mre F Mer. (55) 
Hence in the most general case there are 36 photoelastic constants. Crystal 
symmetrics cut down the number of constants as shown in a later section, 
The tensor fmno defined in equation (45) as 
3 77 
2 0 l - 
(4) 1! mno = ne o o (56) 
96m Ibn O80 
shows that we can interchange the order of m and u since U contains product 
terms of 6,, and 6, . Hence 
Tmno = Tnmo (57 ) 
and this is usually replaced by the two index symbols 
Too = Tmno(m, n, 0 = 1 to 3;g = 1 to 6). 
The so called “true” electro-optic constants are measured at constant 
strain and for this case the modifications in the impermeability constants 
are given by the equation 


Em = Dn [Ben + rnoDol - (58) 


Since m and w are interchangeable, the third rank tensor is usually replaced 


by the two index symbols 
tuno = Toolm, n,o = 1 to3;q = 1 to 6). (59) 


As discussed in the next sections, these constants can be determined by 
applying an electric field of a frequency high enough so that the principal 
resonances and their harmonics cannot be excited by the applied field, and 
measuring the resulting birefringence along definite directions in the crystal. 
On the other hand if we apply a static field to the crystal, an additional effect 
occurs because the crystal is strained by the piezoelectric effect and this 
causes a photoelastic effect in addition to the ‘true’ electro-optic effect. A 
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better designation for these effects is the electro-optic effect at constant 
strain and stress. 

This latter effect can be calculated from equation (47) by setting the 
stresses 7,¢ equal to zero and eliminating the S;; strains. After neglecting 
second order corrections, 


+ ) Ss Ss mM; mn h,, { 
Bix = Di ” a |, a + Tmno + : ‘) dD; . (60) 
4orc; ijk t 


+? D . . . 
Since Mont/ Ciné = Soij, the other piezoelectric constant relating the open 


circuit voltage to the stress, the electro-optic effect at constant stress can be 
written in the form 


Ss Mijmn Rois 


r . 
T'mno = mno + 4 (61 ) 
T 


In terms of the two index symbols 
T S Mpg Lop 


foo = Too + ~~ 


F . 4 ‘ . ‘ 
since it has been shown that goi; = gop/2 when? ¥ j, and the tensor in (61) 
has 17 as common symbols which involves the summations of two terms. 


(62) 


The electro-optic effect is usually measured in terms of an applied field. 
The change in the impermeability constant 8), for this case can be de- 
termined from the first equations (47), setting 7;¢ equal to zero and neglect- 
ing second order terms. Multiplying through by the tensor K¢, of the di- 
electric constants 

D, = Ev Ke» (63) 
since the product K,8o, = 1. ne this equation into (58) we have 


E. = * = Ds [Boum oz ull op ip Bol = Dn [Bmn +- ZS no Eel . (64) 


where the new tensor Zmno iS equal to 


8 s -T 
> —_ 
-mno ~~ Tuaplk op. 


In terms of the two index symbols 


S 8 -T 
Zq0 —_ Feplien . (60) 


in which the repeated index indicates a summation. The difference between 
the electro-optical constant at constant stress expressed in terms of the field 
and the electro-optical constant at constant strain is 


T _S ee Mijmn g 
4 


o —_ 
-mno ~~ «mno ! 


oi) , Ss ' ~ 
ql = Sano + Mijmn Opi (67) 


since the piezoelectric constants d,,; are related to the g constants by the 
equation 
-T 
i K 
dyij = eee ee (68) 
4 
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In terms of two index symbols 
T 8 
Zo = 20 + Mpqdop(p, g = 1 to 6; o = 1 to 3) (69) 
where a repeated index means a summation with respect to this index. 
Finally the photoelastic effect is sometimes expressed in terms of the 
stresses rather than the strains. As can be seen from equation (47), the new 
set of constants is 


D ~ 
Tq = MoprSrq (70) 


1 ° ° ° ° 
where the s,, are the elastic compliances measured at constant electric dis- 


placement. 


V. BrREFRINGENCE ALONG ANy DtREcTION IN THE CRYSTAL AND 
DETERMINATION OF THE ELECTRO-OPTIC AND 
PHOTOELASTIC CONSTANTS 

If we take axes along the Fresnel ellipsoid when no stress or field is ap- 
plied to the crystal, the result of the electro-optic and photoelastic effects 
is to change the impermeability constants by the values 


by = By + Ay ; Bo = Po + Ao ; B33 = B; + A; 
Boz = Aq; Bis = As; Bio = Ag 
where 
Ay = 2) L ZK: ot 213k3 1 mid -}- MoS 4 M1393 + myS4 
+ misSs5 + mioSe 
Ao = 228, + 2Zo9K2 + 203K MS) + Ma2S2 4- M353 + MoS 4 
+ maSs5 + moeSe 
As = 2318) + 222K 323 + mS) + m32S2 + msaS3 + mS 
+ M3S5 + M56 
Ag = 2yFy + 24.Ko + 243K3 + my Si + meSo + mazaS3 + masSs 
+ maySs + maioSe 
As = 2tt) + 25el%2 + 253823 +- merSi + mysoS2 + ms5353 + mssS4 
+ ms5S5 -+ mseSe 
As = 2eht) + 2e2H2 + 2633 + merS1 + meeSe + mezS3 + mesSs 
-+ MesS5 -f- Meo ¢ ° 
If we transmit light along the z’ axis which, as shown by Fig. 2, makes an 
angle of @ degrees with the z axis in a plane making an angle ¢ with the xz 
plane, the birefringence can be calculated as follows: Keeping z’ fixed and: 
rotating the other two axes about z’ by varying the angle y, one light vector 
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° / ° ° / . ss 
will occur when 8); is a maximum and the other when 6; is a minimum. 
Using the transformation equations (31) and the direction cosines of (27), 
a ys e . 
we find that 61; is given by the equations 


sin 2 sin 2y cos 6 
) 


Bi1 = Bir| cos 6 cos g cos py — + sin’ ¢ sin’ y 


+ Bys[sin 2g cos 2 — sin’ 6 sin 29 cos’ + cos @ sin 2p cos 2¢] 
+ 6;[—sin 26 cos ¢ cos" y + sing sin @ sin 2p} 


cos 6 sin 2g sin 2p 2 . 2 
: + cos ¢ sin 


+ Bo | cos @ sin’ g cos y + 
+ B2;{—sin 26 sin gy cos y — sin 6 cos ¢ sin 2¥] + B33 sin 6 cos 


, 
311 . 
0, we find an ex- 


. . . . ° of 
Differentiating with respect to y and setting 9 
c 


pression for tan 2¥ in the form 


— 81, sin 2y cos 6 2812 cos 6 cos 2¢ 
Y 


ton 6 = + 2813 sin g sin 8 + B22 cos @ sin 29 — 28; sin 8 cos ¢ (74) 
Biilcos 9 cos g — sin ¢g] + Bii(1 + cos’ 4) sin 2¢] 
— Bi; sin’ O@ces gy + B22(cos 6 sin’ gy — cos ¢) 


— Bo; sin 20 sin g + B33 sin @ 


Inserting this value back in equation (73) we find that the two extreme values 
/ . . 
of 8, are given by the equation 
28' 11 = 2Bo» an (By, oa Bo») (cos? 6 cos? ~ + sin? ¢) : (833 - Boo) sin® 0 
— By sin? 6 sin 29 — Biz sin 26 cose — Be; sin 24 sin ¢ 
(B11 — B22)?(cos? @ cos? g + sin? g)* + 2(81 — B22)(833 — B22) sin? OX 
(cos? 6 cos* g — sin* ¢) -fe (B33 — Boe)* sin 0 — 2(81 — Boy) X 
[3:2(sin 2g sin? @(cos? @ cos? g + sin® ¢) + Bi; sin 26 cos ¢X 


(cos? 86 cos @ + sin’g) — Bo; sin 26 sin g(1 + cos* ¢ sin* 4)| 


+ 2(83; — Be) sin? [8 sin 2¢(1 + cos? @) — B13 sin 26 cos ¢ 
(75) 


— Bo; sin 26 sin g| + (28;2)*{sin* @ sin’? ¢ cos? ¢ + cos? 8] 
— 48,0813 sin? @ sin g{cos” 6 cos* g + sin? ¢| — 4(By» Bos) 
[sin 20 cos g(sin? ¢ cos? + cos? g)} + (2813)* sin? 6X 
(cos? 8 cos? g + sin? ¢) — 4813823 sin 2y sin‘ 6 


+ (28»3)? sin? (cos? @ sin? g + cos? ¢) 
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The birefringence in any direction can be calculated from equation (75); 
since Bit = 0; V’, it equals 1 m where y; is the index of refraction corre- 
sponding to a light wave with its electric displacement in the 8’); direction. 
Similarly, for the second solution at right angle to the first, 


(76) 
Hence if we designate the expression under the radical by Ke and half the 


expression on the right outside the radical by K;, we have 


1 1 1 
wee _ Se “~ — s = V/Ko - (77) 


9 


—? 
Mi Me By ‘ 
Since 4; and ye are very nearly equal even in the most birefringent crystal, 
we have nearly 


3 
de -~ wm = B= . VK2. (78) 


For special directions in the crystal, the expression for Ky simplifies very 
considerably. Along the x, y and z axes, the values are 


3 
= 90°); Bs = 5 V(Bss — Biz)? + (282s)? 


a= OO"): = > V/ (Bu a B33)" + (2813)? 


3 


Z, (g = 0°,6 = 0°); B, = a Vv (Bu - Boe)? + (262)?. 


~ 


If any natural birefringence exists along these axes, (26;)? will be 
small compared to this and 


1 
3 + A; — a.) 
Mb 


1 
“3 7+ 4&1 - s,) (80) 


€ 


’ 4 1 
(8, - Be + A, — Ae) = —~ ata as), 
Mb 


Hence, for this case, measurements along the three axes will tell the differ- 
ence between the three effects A, , Ao and A; . To get absolute values requires 
a direct measurement of the index of refraction along one of the axes and 
its change with fields or stresses. This is a considerably more difficult meas- 
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urement than a birefringence measurement and requires the use of an ac- 
curate interferometer. 

If, however, the Z axis is an optic axis as it is in ADP, for example, and 
A, = A, = 0, a birefringence occurs due to the term 8». As shown in the 
next section, the electro-optic constants for ADP (tetragonal 42m) are 24; 
and 23 . 263 occurs in the expression for By = Ag , as can be seen from equa- 
tions (72), and hence the birefringence along the Z axis is 

3 
Bb, = 5 #2812 = Ma 263 E3. (81) 
The constants 23 and z4, have been measured independently by W. L. Bond, 
Robert O’B. Carpenter, and Hans Jaffe. Probably the most accurate meas- 
urements, and the only one published, are those of Carpenter,’ who finds 
that the indices of refraction and the 23 and z_ constants for ADP and 
KDP are in cgs units 


Ma Me 763107 Ta.x07 
ADP 1.5254 1.4798 2.54 + 0.05 6.25 + 0.1 
KDP 1.5100 1.4684 3.15 + 0.07 2.58 + 0.05 


An even larger constant has been found for heavy hydrogen KDP by Zwicker 
and Scherrer.® They find at 20°C that r6; = 6 X 10-7. Using this constant, a 
half wave retardation for a \ = 5461 A®° mercury line occurs for a voltage 
of 4000 volts. 

For tetragonal crystals of these types the only photoeleastic constant for 
the z axis is mee, and the birefringence for this case is given by 


3 . o 
B, - MaMesS¢ (82) 


When a natural birefringence exists for the crystal, measurements of the 
other three effects Ay, A; and Ag can be made by determining the bire- 
fringence along other directions than the Fresnel ellipsoid axes. In a direction 
of Z’ lying in the XYZ plane g = 0, @= variable and 


3 


ae mM /\(Bu — Bo) cos? 6 + (833 — B22) sin? 6 — Bis sin? 6}? 
= V + [2812 cos @ -+ 2803 sin 6). 


When a natural birefringence exists, this reduces to 


uw 1 1 2 
By = 3 — 3+ A; — Ae]cos 6 
” 24 \u Mb 
1 1 e 
+{—+> — 3+ A; — A.])sin 6 — A; sin 26 
Me Mb 


5 “The Electro-optic Effect in Uniaxial Crystals of the Type XH.PO, ,” Robert O’B 
Carpenter, Jour. Opt. Soc. Am., in course of publication. 
6 Zwicker and Scherrer, Helv. Phys. Acta., 17, 346 (1944). 


(83) 


(84) 
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and hence, by measuring at 45° between the two axes, one can evaluate the 
A; term. 
Similarly, for the YZ plane, g = 90°, 6= variable and 
3 / "7 > * 9 e % 
wh , /[—(Bu — 822) + (833 — B22) sin? @ — 8:3 sin 26]? 
2 VV + [28,2 cos 6 — 2643 sin 6]. 


Hence, when a natural birefringence exists, we have 
1 
ee et 
vrs 


1 1 2 
7 G “2 ae 4.) sin @ — Ay sin 20]. 
Me Mb 


In the YY plane @= 90°, g = variable and 


vs JAl(Bi — Bis) sin? og — (B33 — re pe gm teP 
/ + [2613 sin ¢ — B23 cos ¢]*. 


Then, for natural birefringence, 


1 ' 
Ts A 1 . s,) sin 7] 
b 


1 1 
= (= --3+ A, - a.) — Ag sin 26, 
Me = Bb 


Hence, with measurements at 45° between the axes and with suitably ap- 
plied fields and strains, the three effects Ay, 4; and Ag can be measured. 
Since the axes of the test specimen are turned with respect to the XY, Y¥ and 
Z axes, suitable transformations of the effects A; to Ae with respect to the 
new axes will have to be made. These can be done as shown in reference (4) 
by means of tensor transformation formulae. 

Another method for measuring the constants in Ay, As , Ag is to measure 
the amount they rotate the axes of the Fresnel ellipsoid. As an example con- 
sider the 2, constant of ADP. For example, if we look along the X axis and 
apply a field in the same direction, then, in equation (74), @= 90°, ¢ = O and 


- 2Bo23 - it 224) FE 7 — 2b us 241 E, 
833 — Bre 1 1 (up + Me) (uo = Me) 


tan ly (89) 


Me Mp 
According to Carpenter, the 24; electro-optic constant of ADP is 6.25 & 1077 


in cgs units. wa = wo = 1.5254; wu, = 1.4798; hence the angle of rotation for 
a field of 30,000 volts per centimeter = 100 stat volts cm is 


V -2.25 & 10°* radians = 7.7 minutes of arc. (90) 
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VI. ELecrro-opric AND PHOTOELASTIC TENSORS FOR VARIOUS 
CRYSTAL CLASSES 


SINCE Tmno = Tomo ANA Zmno = Znmo are third rank tensors similar to the 
hm ij piezoelectric tensor, they will have the same components for the various 
crystal classes. For the twenty crystal classes that show the electro-optic 
effect these tensors are given below. They are given with the crystal system 
they belong to, and the symmetry is designated by the Hermann-Mauguin 


symbol. The last number of the subscript of s designates the direction of the 


applied static field. 
(91) 


Triclinic; 1 


Monoclinic; 2 


Monoclinic; 2 = m 


Orthorhombic; 222 


Orthorhomic; 2mm 


Tetragonal; 4 
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Tetragonal; 4 0 251 


0 


Tetragonal; 42m 


Tetragonal; 422 


Tetragonal; 4mm 


Trigonal; 3 


Trigonal; 32 


Trigonal; 3m 


Hexagonal; 6 











OPTICAL PROPERTIES IN CRYSTALS 


Hexagonal; 6m2 


Hexagonal; 6 


Hexagonal; 622 


Hexagonal; 6mm 


Cubic; 23 and 43m | 
| 0 
0 


The r tensor has similar terms. 

The photoelastic constants are similar to the elastic constant tensors 
except that m,, # m,, in general. However, for the tetragonal, trigonal, 
hexagonal and cubic systems, Pockels found that my. = m2, . This follows 
from the transformation equations about the Z axis which is the » fold 
axes for these groups. For a rotation of an angle @ about Z, the direction 
cosines are 
- , . / - , 
vA Ox, er as px = po 

Ox, ) OX3 


/ / 
ra] Xe ° OXe 
ts — = —sin@d my, : ; 

Ox, Xe OX3 


a. iia 
OX3 OX3 


Ox, OX 
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rT ° / / / / 
rransforming the two terms m2 = my. and mo = mo by the tensor 


transformation equation 


ag RS Te A. . 
OX; OX; OX, OXE 


Mijkl Whicnice (93) 


OXm OXn OX OXp 
we find, for these two coefficients, 
/ ; 
My. = (my + Mo — Ames) sin? 6 cos? 8 + 2(me — mys) 


° 3 e 3 4 + 4 
sin 8 cos: 8 + 2(me, — mig) sin’ 6 cos 6 + my cos 6 + my sin Oo - 
(94) 
= (my, + moo — 4meg) sin? 6 cos? 6 + 2(mys — mee) 


. 3 4 <i 
sin’ 6 cos 0 + 2(mog — me) sin 6 cos 8 + me cos 6 + my sin 6 
© / / e ° . 
If my. = mo for all angles of rotation we must have 
Mig + Mog = Me, + Meo 


For all the classes that my = mz, either mg = —myg and me = — me Or 
else mig = Mog = Me = Me = 0. 

Now, if Z is a four-fold axis, as it is in the tetragonal and cubic systems, 
then, for a 90° rotation, the value of mj: or ma must repeat. From the first 
of (92) this means that 

My = Me, and my = My 


For a trigonal or hexagonal system additional relations are obtained between 
Mes ANC My, , Mo aNd my in the usual manner. Hence the photoelastic matrices 
become, for the various crystal classes, 
(95) 
Triclinic 36 12 ; ” 15 The w ten- 
Constant sor is en- 
tirely anal- 
ogous 


Monoclinic 12 ' ; The 7 ten- 

20 Con sor is en- 

stants ta 22 ca % tirely anal 
ogous 
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Ortho- Mis 
rhombic 12 
Constants Meo 


M32 


Tetragonal 
4,4,4/m 9 
Constants 


me, 


Tetragonal mit Myo 
42m, 422 

4mm, Myi2 Mi 
(4/m)mm 

7 Constants 


Trigonal mi 
3,3 11 Con 
stants 


Myo 
Ms) 
m4 


Meso 


Trigonal 
32,3m 
3(2/m) 8 
Constants 


m3 0 
Mag 0 


33 


Mis 
my3 
33 
0 
0 


Mis 
Mis 
Meas 


0 


my 
my 


Meas 


185 


The x ten 
sor is en 
tirely anal- 


ogous 


The w ten 
sor is en 
tirely anal 
ogous 


The w ten 
sor is en 
tirely anal 
ogous 


The w ten 
sor ts anal 
ogous ex 
cept that 
we = 22 
6 = ley 
™ = 

(wh 
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Hexagonal my, M2 Mz The zw ten- 

6,6m2.6 sor is anal- 

622 ,6/m; m2 My = M13 ogous ex- 
cept that 

rs = 

6 Constants wu — Fiz 


6 
Omm,— mm M31 M3, M33 
m 


Mi,— M2 


Cubic Sys The zw ten- 
tem 23,432 sor is en- 
2 a2 tirely anal 
3,43m,—3 ess 
m mm ogous 
? +e 
3 Constants (95) 


Isotropic Miz The m ten- 
Systems 2 sor is anal- 
Constants mi2 ogous ex- 
cept that 
re = 
Mi, — Mie mu — 2 
2 


mi 
0 


Mi — M2 
? 


My, ~~ My 
9 


From measurement’ on the photoelastic effects at high pressure for cubic 
crystals, it has become apparent that the second derivatives of equation 
(44) are not sufficient to represent the experimental results and derivatives 
up to the fourth power should be included. This extension, however, is not 
considered in the present paper. 


VII. PHOTOELASTICITY IN IsorROPIC MEDIA 


The photoelastic effect in isotrepic solids has been used extensively in 
studying the stresses existing in machine parts and other pieces. For this 
purpose a plastic model cut in the shape of the original is used and is loaded 
in a similar manner to that of the machine part to be studied. Since stresses 
are applied, the 7; photoelastic constants are most useful. If we look along 


7H. B. Maris, Jour. Optical Society of Amer., Vol. 15, pp. 194-200, 1927. 
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the Z axis, the last of equations (79) shows that the birefringence is equal 
to 


3 
B= 5 V (Bi + Ar — B2 — Ae)? + 4(As)? (96) 


Since, for an isotropic substance 8; = 62, we have, after substituting the 
value of 4, and A; , with the appropriate photoelastic constants from equa- 
tion (95), (last tensor): 


3 

rm 7-1" Say hin 

B, = > (mu — mV (T; — T,)? + 4T? (97) 

If we transform to axes rotated by an angle @ about Z, the values of 7); 
and are given by 


To cos 707; + 2 sin 6 cos 67, + sin? 67, 


v) _/ . ° a) ° Ue el * . 
To. = sin *67, — 2 sin 6 cos 67, + cos OT» 
_/ . . - 
If, now, we choose the angle @ so that 7); is a maximum, we find 


_ +27, 
tan 20 = TT, 
Inserting this value of tan 26 in (98) we find 
_ Tit Ts 
2 
Ti + T: 
2 





+ 4V(T; — 72)? + 47? 


— 4V(T, — Ti)? + AT? 
and, hence, 
Ti -— T: = V(T, — Try +47 ¢ (101) 

Hence the birefringence obtained in stressing a material is proportional to 
the difference in the principal stresses. By observing the isoclinic lines of a 
photoelastic picture, methods® are available for determining the stresses 
in a model. A photograph® of a stressed disk is shown by Fig. 3. The high 
concentration of lines near the surface shows that the shearing stress is 
very high at these points. By counting the number of lines from the edge 
and knowing the stress optical constant, the stress can be calculated at any 
point. 

If we apply a single stress 7, , the birefringence is given by the equation 


B, = : (mir — Ww)T) (102) 


8 See Photoelasticity, Coker and Filon, Cambridge University Press, 1931. 
* This photograph was taken by T. F. Osmer. 
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Instead of using the constants m, and my it is customary to use a single 
constant C given by 

B= m4—- wo =r = CT (103) 


where the constant C is called the relative stress optical constant and r the 
retardation. The dimensions of C are the reciprocal of a stress and are 


? 


Fig. 3—Photoelastic picture of a disk in compression. 


measured in cm? per dyne. A convenient unit for most purposes is one of 
10° cm? dyne; if this is used, the stress optical coefficients of most glasses 
are from 1 to 10 and most plastics are from 10 to 100, This unit so defined 
has been called the ‘‘Brewster”’. In terms of the Brewster, the retardation is 


r = CTd (104) 


If C is measured in Brewsters, d in millimeters and 7 in bars (10° dynes 


cm*) then r, as given by the formula, is expressed in angstrom units, 
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[SECOND INSTALLMENT] 


CHAPTER IV 
FILTER-TYPE CIRCUITS 
SYNOPSIS OF CHAPTER 


SIDE FROM HELICES, the circuits most commonly used in traveling 
wave tubes are iterated or filter-type circuits, composed of linear 
arrays of coupled resonant slots or cavities. 

Sometimes the geometry of such structures is simple enough so that an 
approximate field solution can be obtained. In other cases, the behavior of 
the circuits can be inferred by considering the behavior of lumped-circuit 
analogues, and the behavior of the circuits with frequency can be expressed 
with varying degrees of approximation in terms of parameters which can be 
computed or experimentally evaluated. 

In this chapter the field approach will be illustrated for some very simple 
circuits, and examples of lumped-circuit analogues of other circuits will be 
given. The intent is to present methods of analyzing circuits rather than 
particular numerical results, for there are so many possible configurations 
that a comprehensive treatment would constitute a book in itself. 

Readers interested in a wider and more exact treatment of field solutions 
are referred to the literature.'? 

The circuit of Fig. 4.1 is one which can be treated by field methods. This 
“corrugated waveguide” type of circuit was first brought to the writer’s 
attention by C. C. Cutler. It is composed of a series of parallel equally spaced 
thin fins of height / projecting normal to a conducting plane. The case treated 
is that of propagation of a transverse magnetic wave, the magnetic field 
being parallel to the length of the fins. It is assumed that the spacing ¢ is 
small compared with a wavelength. In Fig. 4.2, 8h is plotted vs. 8o4. Here 8 
is the phase constant and 8) = w/c is a phase constant corresponding to the 
velocity of light. 

‘FE, L. Chu and W. W. Hansen, “The Theory of Disk-Loaded Wave Guides,” Journal 
of Applied Physics, Vol. 18, pp. 999-1008, Nov. 1947 


2 L. Brillouin, “Wave Guides for Slow Waves,” Journal of Applied Physics, Vol. 19 
pp. 1023-1041, Nov. 1948. 
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For small values of 8o/, that is, at low frequencies, very nearly 8 = Bo; 
that is, the phase velocity is very near to the velocity of light. The field 
decays slowly away from the circuit. The longitudinal electric field is small 
compared with the transverse electric field. In fact, as the frequency ap- 
proaches zero, the wave approaches a transverse electromagnetic wave 
traveling with the speed of light. 

At high frequencies the wave falls off rapidly away from the circuit, and 
the transverse and longitudinal components of electric field are almost equal. 
The wave travels very slowly. As the wavelength gets so short that the 
spacing ¢ approaches a half wavelength (@¢ = 7) the simple analysis given 
is no longer valid. Actually, 8 = m specifies a cutoff frequency; the circuit 
behaves as a lowpass filter. 

Figure 4.3 shows two opposed sets of fins such as those of Fig. 4.1. Such 


a circuit propagates two modes, a transverse mode for which the longi- 


tudinal electric field is zero at the plane of symmetry and a longitudinal 
mode for which the transverse electric field is zero at the plane of symmetry. 

At low frequencies, the longitudinal mode corresponds to the wave on a 
loaded transmission line. The fins increase the capacitance between the con- 
ducting planes to which they are attached but they do not decrease the 
inductance. Figure 4.6 shows 8/ vs. Bo/ for several ratios of fin height, /, 
to half-separation, d. The greater is //d, the slower is the wave (the larger 
is B/Bo). 

The longitudinal mode is like a transverse magnetic waveguide mode; it 
propagates only at frequencies above a cutoff frequency, which increases 
as h/d is increased. Figure 4.7 shows Bh vs. Bot = (w/c)h for several values 
of h/d. The cutoff, for which 8¢ = z, occurs for a value of Bo/ less than 2/2. 
Thus, we see that the longitudinal mode has a band pass characteristic. The 
behavior of the longitudinal mode is similar to that of a longitudinal mode of 
the washer-loaded waveguide shown in Fig. 4.8. The circuit of Fig. 4.8 has 
been proposed for use in traveling-wave tubes. 

The transverse mode of the circuit of Fig. 4.3 can also exist in a circuit 
consisting of strips such as those of Fig. 4.1 and an opposed conducting 
plane, as shown in Fig. 4.5. This circuit is analogous in behavior to the disk- 
on-rod circuit of Fig. 4.9. The circuit of Fig. 4.5 may be thought of as a 
loaded parallel strip line. That of Fig. 4.9 may be thought of as a loaded 
coaxial line. 

Wave-analysis makes it possible to evaluate fairly accurately the trans- 
mission properties of a few simple structures. However, iterated or repeating 
structures have certain properties in common: the properties of filter 
networks. 

For instance, a mode of propagation of the loaded waveguide of Fig. 4.10 
or of the series of coupled resonators of Fig. 4.11 can be represented ac- 
curately at a single frequency by the ladder networks of Fig. 4.12. Further, 





FILTER-TYPE CIRCUITS 191 


if suitable lumped-admittance networks are used to represent the admit- 
tances B, and Be, the frequency-dependent behavior of the structures of 
Figs. 4.10 and 4.11 can be approximated. 

It is, for instance, convenient to represent the shunt admittances By, and 
the series admittances B, in terms of a “longitudinal” admittance B, and 
a “transverse” admittance B; . B, and By are admittances of shunt resonant 
circuits, as shown in Fig. 4.15, where their relation to B, and By and ap- 


proximate expressions for their frequency dependence are given. The res- 
onant frequencies of B, and By, that is, w, and wr, have simple physical 


meanings. Thus, in Fig. 4.10, w, is the frequency corresponding to equal 
and opposite voltages across successive slots, that is, the r mode frequency. 
wr is the frequency corresponding to zero slot voltage and no phase change 
along the filter, that is, the zero mode frequency. 

If w, is greater than w,, the phase characteristic of this lumped-circuit 
analogue is as shown in Fig. 4.17. The phase shift is zero at the lower cutoff 
frequency wy and rises to w at the upper cutoff frequency w, . If wr is greater 
than w, , the phase shift starts at —z at the lower cutoff frequency w, and 
rises to zero at the upper cutoff frequency wr, as shown in Fig. 4.19. In this 
case the phase velocity is negative. Figure 4.20 shows a measure of (#/8°P) 
plotted vs. w for w,;, > wr. This impedance parameter is zero at wr and rises 
to infinity at w,. 

The structure of Fig. 4.11 can be given a lumped-circuit equivalent in a 
similar manner. In this case the representation should be quite accurate. 
We find that w, is always greater than wr and that one universal phase curve, 
shown in Fig. 4.27, applies. A curve giving a measure of (£°/6?P) vs. fre- 
quency is shown in Fig. 4.28. In this case the impedance parameter goes to 
infinity at both cutoff frequencies. 

The electric field associated with iterated structures does not vary sinus- 
oidally with distance but it can be analyzed into sinusoidal components. 
The electron stream will interact strongly with the circuit only if the elec- 
tron velocity is nearly equal to the phase velocity of one of these field com- 
ponents. If @ is the phase shift per section and ZL is the section length, the 
phase constant 8,, of a typical component is 


Bm = (0+ 2mr)/L 
where m is a positive or negative integer. The field component for which 
m = (is called the fundamental; for other values of m the components are 
called spatial harmonics. Some of these components have negative phase 
velocities and some have positive phase velocities. 
The peak field strength of any field component may be expressed 
E = —M(V/L) 
Here V is the peak gap voltage, L is the section spacing and M isa function 
of 6 (or 8,,) and of various dimensions. For the electrode systems of Figs. 
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4.29, 4.30, 4.31 and 4.32 M is given by (4.69), (4.71), (4.72) and (4.73), 
respectively. 

The factor M may be indifferently regarded as a factor by which we 
multiply the a-c beam current to give the induced current at the gap, or, 
as a factor by which we multiply the gap voltage in obtaining the field. We 
can go further, evaluate £°/s°P in terms of gap voltage, and use M*Jo as the 
effective current, or we can use the current /» and take the effective field in 
the impedance parameter as 


FE? = M*(V/f¢)? 


It is sometimes desirable to make use of a spatial harmonic (m # 0) 
instead of a fundamental, usually to (1) allow a greater resonator spacing 
(2) to obtain a positive phase velocity when the fundamental has a negative 
phase velocity (3) to obtain a phase curve for which the phase angle is 
nearly a constant times frequency; that is, a phase curve for which the group 
velocity does not change much with frequency and hence can be matched 
by the electron velocity over a considerable frequency range. Figure 4.33 
shows how 6 + 27 (the phase shift per section for m = 1) can be nearly a 
constant times w even when @ is not. 


4) ree 


Fig. 4.1—A corrugated or finned circuit with filter-like properties. 


: : ' 
4.1 Fretp SOLUTIONS 


An approximate field analysis will be made for two very simple two- 
dimensional structures. The first of these, which is shown in Fig. 4.1, is 
empty space for y > 1 and consists of very thin conducting partitions in the 
y direction from y = 0 to y = —h; the partitions are connected together 
by a conductor in the z direction at y = —/. These conducting partitions 
are spaced a distance ¢ apart in the z direction. The structure is assumed to 
extend infinitely in the +. and —.x directions. 

In our analysis we will initially assume that the wavelength of the propa- 
gated wave is long compared with ¢. In this case, the effect of the partitions 
is to prevent the existence of any y component of electric field below the z 
axis, and the conductor at y = —/ makes the z component of electric field 
zero at y = —Z. 

In some perfectly conducting structures the waves propagated are either 
transverse electric (no electric field component in the direction of propaga- 
tion, that is, z direction) or transverse magnetic (no magnetic field com- 
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ponent in the z direction). We find that for the structure under consideration 
there is a transverse magnetic solution. We can take it either on the basis 
of other experience or as a result of having solved the problem that the 
correct form for the « component of magnetic field for y > 0 is 

HH, = ie" ™ (4.1) 


Expressing the electric field in terms of the curl of the magnetic field, we have 


OHz OdHy 


¢ dy ra] a 


jwels, 


OHx dHz 


weE, = . 
; ‘ dz Ox 


We can in turn express //, in terms of £, and EF, 


Okz OER y 


soe jou, sa F 
2 oy Of 


This leads to the relation 

(4.7) 
Now, 1/V ue is the velocity of light, and w divided by the velocity of light 
has been called @y , so that 

(4.8) 


Between the partitions, the field does not vary in the z direction. In any 
space between from y = 0 to y = —h, the appropriate form for the magneti: 
field is 
cos Boly + fh) 


H, = Ho 


COS Bok 


From this we obtain by means of (4.4) 


in ee 


We Cos Bok 


sin Boly + h) 


(4.10) 


Application of (4.6) shows that this is correct. 
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Now, at y = 0 we have just above the boundary 


R= «~jitpe™ (4.11) 


—J © 
WE 


The fields in the particular slot just below the boundary will be in phase 
with these (we specify this by adding a factor exp —j8z to 4.10) and hence 
will be 

_ Bo 


WE 


EK, = Hye *™ tan Boh (4.12) 


From (4.11) and (4.12) we see that we must have 


Boh tan Boh = yh (4.13) 






































Le 
LL ——e 


0 0.2 0.4 0.6 0.8 1.0 1.2 1.4 
Boh 
Fig. 4.2—The approximate variation of the phase constant 8 with frequency (propor- 
tional to Bo/) for the circuit of Fig. 4.1. The curve is in error as 8€ approaches w, and there 
is a cutoff at Bf = x, 





Using (4.8), we obtain 
Pan (4.14) 
COS Boh 
In Fig. 4.2, 8 has been plotted vs 8)/, which is, of course, proportional to 
frequency. This curve starts out as a straight line, 8 = po ; that is, for low 
frequencies the speed is the speed of light. At low frequencies the field falls 
off slowly in the y direction, and as the frequency approaches zero we have 
essentially a plane electromagnetic wave. At higher frequencies, 8 > 8», 
that is, the wave travels with less than the speed of light, and the field falls 
off rapidly in the y direction. According to (4.14), 6 goes to infinity 
at Boh = w/2. 
As a matter of fact, the match between the fields assumed above and below 
the boundary becomes increasingly bad as 3¢ becomes larger. The most rapid 
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alteration we can have below the boundary is one in which fields in alternate 
spaces follow a +, —, +, — pattern. Thus, the rapid variations of field above 
the boundary predicted by (4.14) for values of 894 which make Sf greater 
than w cannot be matched below the boundary. The frequency at which 
Bf = x constitutes the cutoff frequency of the structure regarded as a filter. 
There is another pass band in the region r < 89 < 3/2, in which the ratio 
of E to H below the boundary has the same sign as the ratio of E to H above 
the boundary. 

A more elaborate matching of fields would show that our expression is 
considerably in error near cutoff. This matter will not be pursued here; the 
behavior of filters near cutoff will be considered in connection with lumped 
circuit representations. 

We can obtain the complex power flow P by integrating the Poynting 
vector over a plane normal to the z direction in the region y > 0. Let us 
consider the power flow over a depth W normal to the plane of the paper. 
Then 


2 Ww 
“= l (E,Hy — E,H?) dx dy (4.15) 
+ #0 0 


Using (4.1) and (4.3), we obtain 


P= a BH ec dy 
2 Jo (we " 


(4.16) 


p 1 Haw 
4 wey 


We will express this in terms of E the magnitude of the z component of 


the field at y = 0, which, according to (4.5), is 


Eau 2.8, (4.17) 


WE 
We will also note that 
Et = wV ue Vv ‘u ‘€ 
= (w/c)/Vp/e = Bo/V p/e 
and that 
V u/e = 377 ohms (4.19) 
By using (4.17)-(4.18) in connection with (4.16), we obtain 
E’/8°P = (4/BoW)(y/8)’ Vu/e (4.20) 


We notice that this impedance is very small for low frequencies, at which 
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the velocity of the wave is high, and the field extends far in the y direction 
and becomes higher at high frequencies, where the velocity is low and the 
field falls off rapidly. 

We will next consider a symmetrical array of two opposed sets of slots 
(Fig. 4.3) similar to that shown in Fig. 4.1. Two modes of propagation will 
be of interest. In one the field is symmetrical about the axis of physical 
symmetry, and in the other the fields at positions of physical symmetry are 
equal and opposite. 

In writing the equations, we need consider only half of the circuit. It is 
convenient to take the z axis along the boundary, as shown in Fig. 4.4. 


777 /, 7) 






































4WAUA J 





Fig. 4.3—A double finned structure which will support a transverse mode (no —_ 
tudinal electric field on axis) and a longitudinal mode (no transverse electric field on axis) 


Ye)". 


Fig. 4.4--The coordinates used in connection with the circuit of Fig. 4.3. 





This puts the axis of symmetry at y = +d, and the slots extend from y = 0 
to y = h. 

For negative values of y, (4.9), (4.10), (4.12) hold. 

Let us first consider the case in which the fields above are opposite to the 
fields below. This also corresponds to waves in a series of slots opposite a con- 
ducting plane, as shown in Fig. 4.5. In this case the appropriate form of the 


magnetic field above the boundary is 


‘OS (¢ — ) F 
H. = H, ove — 9) (4.21) 


cosh yd 
From Maxwell’s equations we then find 


B H, Cosh y(d — y) 


we cosh yd 
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Biwced sinh y(d — y) ? 
Bo = 
At y = 0 we have from (4.23) and (4.12) 


E, = -—j Y Hoe™™ tanh yd 


: we 


° Bo 


WE 


Hye ™ tan Boh 


Hence, we must have 


yh tanh (d/h)yh) = Boh tan Boh (4.26) 


CONOYCT/ING 
PLANE 





Fig. 4.5—-The transverse mode of the circuit of Fig. 4.3 exists in this circuit also. 
Here we have added parameter, (d/h). For any value of @ h, we can obtain 
yh vs Boh; and we can obtain Bh in terms of yi by means of 4.24 

2 f 2\1/2 ~~ 
Bh = ((yh)” + (Boh)°) (4.27) 
We see that for small values of 8) (low frequencies) 


+? = (h/d) Bi (4.28) 


} 


+ 4 (4.20) 
d 


If we examine Fig. 4.5, to which this applies, we find (4.28) easy to explain. 
At low frequencies, the magnetic field is essentially constant from y d 


to y = —A, and hence the inductance is proportional to the height # + d. 


The electric field will, however, extend only from y = 0 to y = d; hence 
the capacitance is proportional to 1/d. The phase constant is proportional 
to WLC, and hence (4.29). At higher frequencies the electric and magneti 
fields vary with y and (4.29) does not hold. 

We see that (4.26) predicts infinite values of y for Bh m2. As in the 


previous cases, cutoff occurs at 8f = . 
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As an example of the phase characteristic of the circuit, 84 from (4.26) 
and (4.27) is plotted vs 89h for h/d = 0, 10, 100 in Fig. 4.6. The curve for 
h/d = 0 is of course the same as Fig. 4.2. 

If we integrate Poynting’s vector from y = 0 to y = d and for a distance 
W in the x direction, and multiply by 2 to take the power flow in the other 
half of the circuit into account, we obtain 


i St ai al sinh’ yd 
*/g° P = (2/8 W)(y/6) cccnell oS ) Sule (4.30 
nive (2/Bo W)(y/B) (= yd cosh yd + yd Vu/e 


eae 


7h tanh (2) yh=Aoh tanBoh 








ne 0, 10, 100 maaan 2 


Bh = \(7h)* + (Goh)? 
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Fig. 4.6—The variation of 8 with frequency (proportional to Bo) for the transverse 
mode of the circuit of Fig. 4.3. Again, the curves are in error near the cutoff at Bf = r. 


At very low frequencies, at which (4.28) and (4.29) hold, we have 
E*/8° P = (y*/Bo8°)(d/W) Vu/e 


‘ar : ‘ (4.31) 
E*/p° P = (h/d)'" (1 + d/h)” (d/W) Vu/e 


At high frequencies, for which yd is large, (4.30) approaches } of the value 


given by (4.20). There is twice as much power because there are two halves 
to the circuit. 

Let us now consider the case in which the field is symmetrical and E, does 
not go to zero on the axis. In this case the appropriate field for y > 0 is 


— sinh y(d — y) —js, (4.32) 


sinh yd 
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Proceeding as before, we find 


yh 
. - = Boh tan Bok 
tanh ((d/h) yh) 
We see that, in this case, for small values of yi we have 

Bok tanh Bok = h/d (4.33a) 
There is no transmission at all for frequencies below that specified by (4.33). 
As the frequency is increased above this lower cutoff frequency, y/ and 
hence 8h increase, and approach infinity at Bo = 2/2. Actually, of course, 
the upper cutoff occurs at Bf = x. In Fig. 4.7 Bh is plotted vs Boh for h/d = 0, 


24 





—T— = Ach tan@oh 
| tanh(#) 7h 


d =0,10,100 


Ah = V (rh)? + (Boh)? 
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Fig. 4.7—The variation of 8 with frequency (proportional to 8/4) for the longitudinal 
mode of the circuit of Fig. 4.3. This mode has a band pass characteristic; the band narrows 
as the opening of width 2d is made small compared with the fin height. Again, the curves 
are in error near the upper cutoff at 6 = x. 





ro) 


10, 100. This illustrates how the band is narrowed as the opening between 
the slots is decreased. 
By the means used before we obtain 


cosh* yd 
: (4.34) 
sinh yd cosh yd — —,) vere 


We see that this goes to infinity at yd = 0. For large values of yd it be- 
comes the same as (4.30). 


E*/8° P = (2/BoW)(y, a*( 


4.2 PRACTICAL CIRCUITS 


Circuits have been proposed or used in traveling-wave tubes which bear 
a close resemblance to those of Figs. 4.1, 4.3, 4.5 and which have very similar 
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properties’. Thus Field‘ describes an apertured disk structure (Fig. 4.8) 
which has band-pass properties very similar to the symmetrical mode of the 
circuit of Fig. 4.3. In this case there is no mode similar to the other mode, 
with equal and opposite fields in the two halves. Field also shows a disk-on- 
rod structure (Fig. 4.9) and describes a tube using it. This structure has low- 


Q } (070-70 70 0 0 0 10 0 
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Fig. 4.8—This loaded waveguide circuit has band-pass properties similar to those of 
4.7. 











Fig. 4.9—This disk-on-rod circuit has properties similar to those of Fig. 4.6. 
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Fig. 4.10—A circuit consisting of a ridged waveguide with transverse slots or resonators 
in the ridge. 
















































































pass properties very similar to those of the circuit of Fig. 4.5, which are 
illustrated in Fig. 4.6. 

Figure 4.10 shows a somewhat more complicated circuit. Here we have a 
rectangular waveguide, shown end on in a of Fig. 4.10, loaded by a longi- 
tudinal ridged portion R. In 6 of Fig. 4.10 we have a longitudinal cross sec- 

‘EF. B. Llewellyn, U. S. Patents 2,367,295 and 2,395,560. 


‘Lester M. Field, “Some Slow-Wave Structures for Traveling-Wave Tubes,” Proc. 
I.R.E., Vol. 37, pp. 34-40, Jan. 1949. 
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tion, showing regularly spaced slots S cut in the ridge R. The slots S may be 
thought of as resonators. 

Figure 4.11 shows in cross section a circuit made of a number of axially 
symmetrical reentrant resonators R, coupled by small holes H which act as 
inductive irises. 

It would be very difficult to apply Maxwell’s equations directly in de- 
ducing the performance of the structures shown in Figs. 4.10 and 4.11. 
Moreover, it is apparent that we can radically change the performance of 

















Fig. 4.11—A circuit consisting of a number of resonators inductively coupled by means 
of holes. 
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Fig. 4.12—-Ladder networks terminated in w (above) and 7 (below) half sections. Such 
networks can be used in analyzing the behavior of circuits such as those of Figs. 4.10 


and 4.11. 





such structures by minor physical alterations as, by changing the iris size, 
or by using resonant irises in the circuit of Fig. 4.11, for instance. 

As a matter of fact, it is not necessary to solve Maxwell’s equations afresh 
each time in order to understand the general properties of these and other 
circuits. 


4.3 Lumpep ITERATED ANALOGUES 


Consider the ladders of lossless admittances or susceptances shown in 
Fig. 4.12. Susceptances rather than reactances have been chosen because the 





202 BELL SYSTEM TECHNICAL JOURNAL 


elements we shall most often encounter are shunt resonant near the fre- 
quencies considered; their susceptance is near zero and changing slowly but 
their reactance is near infinity. 

If these ladders are continued endlessly to the right (or terminated in a 
reflectionless manner) and if a signal is impressed on the left-hand end, the 
voltages, currents and fields at corresponding points in successive sections 


will be in the ratio exp(-I’) so that we can write the voltages, 


Yu Fas (4.35) 


If the admittances VY; and V2 are pure susceptances (lossless reactors), I 
is either purely real (an exponential decay with distance) or purely imaginary 
(a pass band). In this case I’ is usually replaced by 78. In order to avoid 
confusion of notation, we will use 7@ instead, and write for the lossless case 
in the pass band 


V. = Voe™ (4.35a) 


Thus, 6 is the phase lag in radians in going from one section to the next. 
In terms of the susceptances,* 


cos 86 = 1+ B,/2B, (4.36) 


We will henceforward assume that all elements are lossless. 

Two characteristic impedances are associated with such iterated networks. 
If the network starts with a shunt susceptance B,/2, as in a of Fig. 4.12, then 
we see the mid-shunt characteristic impedance K, 


x. = 2(— Bo( Bz “f 4B,)) 1/2 
If the network starts with a series susceptance 2B; we see the mid-series 
characteristic impedance Kr 
Ky = +(1/2B,)(— Be. + 4B,)/By)'? (4.38) 
Here the sign is chosen to make the impedance positive in the pass band. 
When such networks are used as circuits for a traveling-wave tube, the 
voltage acting on the electron stream may be the voltage across By or the 
voltage across B, or the voltage across some capacitive element of By or 
B,. We will wish to relate this peak voltage V to the power flow P. If the 
voltage across By acts on the electron stream 
V?/P = 2K, (4.39) 
If the voltage across ); acts on the electron stream 
V = 1/7B, 


* The reader can work such relations out or look them up in a variety of books or hand- 
books. They are in Schelkunoff’s Electromagnetic Waves. 
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where 7 is the current in B, 


and hence 


Here the sign has been chosen so as to make V?/P positive in the pass band. 

Let us now consider as an example the structure of Fig. 4.10. We see that 
two sorts of resonance are possible. First, if all the slots are shorted, or if no 
voltage appears between them, we can have a resonance in which the field 
between the top of the ridge R and the top of the waveguide is constant 














Fig. 4.13—A ladder network broken up into 7 sections. 


all along the length, and corresponds to the cutoff frequency of the ridged 


waveguide. There are no longitudinal currents (or only small ones near the 
slots S) and hence there is no voltage across the slots and their admittance 
(the slot depth, for instance) does not affect the frequency of this resonance. 
Looking at Fig. 4.12, we see that this corresponds to a condition in which 
all shunt elements are open, or B, = 0. We will call the frequency of this 
resonance wr , the JT standing for transverse. 

There is another simple resonance possible; that in which the fields across 
successive slots are equal and opposite. Looking at Fig. 4.12, we see that 
this means that equal currents flow into each shunt element from the two 
series elements which are connected to it. We could, in fact, divide the net- 
work up into unconnected 7 sections, associating with each series element of 
susceptance B, half of the susceptance of a shunt element, that is, B2/2, 
at each end, as shown in Fig. 4.13, without affecting the frequency of this 
resonance. This resonance, then, occurs at the frequency w, (/ for longi 
tudinal) at which 


1 + By 4 = 0. (4.43) 


We have seen that the transverse resonant frequency, wr, has a clear 
meaning in connection with the structure of Fig. 4.10; it is (except for small 
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errors due to stray fields near the slots) the cutoff frequency of the wave- 
guide without slots. Does the longitudinal frequency w, have a simple 
meaning? 

Suppose we make a model of one section of the structure, as shown in 
Fig. 4.14. Comparing this with 6 of Fig. 4.10, we see that we have included 
the section of the ridged portion between two slots, and one half of a slot 
at each end, and closed the ends off with conducting plates C. The resonant 
frequency of this model is w, , the longitudinal resonant frequency defined 
above. 

We will thus liken the structure of Fig. 4.10 to the filter network of Fig. 
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Fig. 4.14——A section which will have a resonant frequency corresponding to that for 
radians phase shift per section in the circuit of Fig. 4.10. 


8, = 8, +52 tke 3 Br=Bo te, 3 


BL = 2C, (w-w,) By = 2Czr(w-wr) 


Fig. 4.15—The approximate variation with frequency (over a narrow band) of the 
longitudinal (B;) transverse (Br) susceptances of a filter network. 


4.12, and express the susceptances B, and By in terms of two susceptances 
By, and B, associated with the transverse and longitudinal resonances and 
defined below 


(4.44) 
ee Ve (4.45) 


At the transverse resonant frequency wr, By = 0, and at the longitudinal 
resonant frequency w,, B, = 0. So far, the lumped-circuit representation 
of the structure of Fig. 4.14 can be considered exact in the sense that at 
any frequency we can assign values to By and B, which will give the correct 
values for 6 and for V?, P for the voltage across either the shunt or the series 
elements (whichever we are interested in). 
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We will go further and assume that near resonances these values of B, 
and B, behave like the admittances of shunt resonant circuits, as indicated 
in Fig. 4.15. Certainly we are right by our definition in saying that Br = 0 
at wr, and B, = 0 at w,. We will assume near these frequencies a linear 
variation of By and B, with frequency, which is very nearly true for shunt 
resonant circuits near resonance* 


Br = 2C rlw — Wr) (4.46) 


Here Cy can mean twice the peak stored electric energy per section length 
for unit peak voltage between the top of the guide and the top of the ridge R 
when the structure resonates in the transverse mode, and C;, can mean twice 
the stored energy per section length Z for unit peak voltage across the top 


BAND 


oe 





Wr WL 


Fig. 4.16—Longitudinal and transverse susceptances which give zero radians phase 
shift at the lower cutoff (wo = wr) and @ radians phase shift at the upper cutofi (@ = wy) 


of the slot when the structure resonates in the longitudinal mode. 
In terms of By and B, , expression (4.36) for the phase angle @ becomes 
4B, + Br 
cos 6 = (4.48) 
4B, — by 
We see immediately that for real values of 6 (cos 6 < 1), By and B;, must 
have opposite signs, making the denominator greater than the numerator. 
Figure 4.16 shows one possible case, in which wr < w,. In this case the 
pass band (6 real) starts at the lower cutoff frequency w = wy at which B, 
is zero, cos 6 = 1 (from (4.48)) and 6 = 0, and extends up to the upper 
cutoff frequency w = w, at which B; = 0, cos 6 = —1 and 6 = f. 
* In case the filter has a large fractional bandwidth, it may be worth while to use the 
accurate lumped-circuit forms 
Br = wrC7(w/or — wr/w (4.46a) 


By = w1Cy(w/wy w/w (4.46b) 
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The shape of the phase curves will depend on the relative rates of varia- 
tion of By and B, with frequency. Assuming the linear variations with fre- 
quency of (4.46) and (4.47) the shapes can be computed. This has been done 
for C,/Cr = 1, 3, 10 and the results are shown in Fig. 4.17. 


77 

















O—> OL 
Fig. 4.17 —Phase shift per section, 6, vs radian frequency w for the conditions of Fig. 4.16. 





Fig. 4.18—Longitudinal and transverse susceptances which give —z radians phase 
shift at the lower cutoff (o = wz) and 0 degrees phase shift at the upper cutoff (@ = wr). 
This means a negative phase velocity. 


It is of course possible to make w; > wr. In this case the situation is as 
shown in Fig. 4.18, the pass band extending from w, to wr. At w = wz, 
cos 6 —1, 0 —r. Atw = wr, cos 6 = 1 and 6 = 0. In Fig. 4.19, as- 
suming (4.46) and (4.47), @ has been plotted vs w for C,/Cr = 1, 3, 10. 


The curves of Figs. 4.17 and 4.18 are not exact for any physical structure 
of the type shown in Fig. 4.10. In lumped circuit terms, they neglect coupling 
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between slots. They will be most accurate for structures with slots longitu- 
dinally far apart compared with the transverse dimensions, and least ac- 
curate for structures with slots close together. They do, however, form a 
valuable guide in understanding the performance of such structures and in 
evaluating the effect of the ratio of energies stored in the fields at the two cut- 
off frequencies. 
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Fig. 4.19—Phase shift per section, 0, vs radian frequency, w, for the conditions of Fig 
4.18. 


It is most likely that the voltages across the slots would be of most in” 
terest in connection with the circuit shown in Fig. 4.10. We can rewrite 
(4.41) in terms of By and B, 


1 


| 7 
214 —~ 4B; /Be)(—Be Bi)” (4.49) 


We see that V?/P goes to 0 at By = 0 (w = wr) and to infinity at B, = O 
(w = w,). In Fig. 4.20 assuming (4.46) and (4.47), (V?/P)(@iCywrC 7) is 
plotted vs w for C,/Cr = 1, 3, 10. 

Let us consider another circuit, that shown in Fig. 4.11. We see that this 
consists of a number of resonators coupled together inductively. We might 
draw the equivalent circuits of these resonators as shown in Fig. 4.21. Here 
L and C are the effective inductance and the effective capacitance of the 
resonators without irises. They are chosen so that the resonant frequency 
wo is given by 


wy = WLC (4.50) 
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and the variation of gap susceptance B with frequency is 
0B/dw = 2C 


The arrows show directions of current flow when the currents in the gap 
capacitances are all the same. 
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. 4.20-—-A quantity proportional to (4/8?P) vs w for the conditions of Figs. 4.16 
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Fig. 4.21—A representation of the resonators of Fig. 4.11. 





We can now represent the circuit of Fig. 4.11 by interconnecting the 
circuits of Fig. 4.21 by means of inductances Ly of Fig. 4.22. This gives a 
suitable representation, but one which is open to a minor objection: the 
gap capacitance does not appear across either a shunt or a series arm. 

It is important to notice that there is another equally good representa- 
tion, and there are probably many more. Suppose we draw the resonators as 
shown in Fig. 4.23 instead of as in Fig. 4.21. The inductance L and capaci- 
tance C are still properly given by 4.50 and 4.51. We can now interconnect 
the resonators inductively as shown in Fig. 4.24. 

We should note one thing. In Fig. 4.21, the currents which are to flow in 
the common inductances of Fig. 4.22 flow in opposite directions when the 
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gap currents are in the same directions. In the representation of Fig. 4.23 
the currents which will flow in the common inductances of Fig. 4.24 have 
been drawn in opposite directions, and we see that the currents in the gap 
capacitances flow alternately up and down. In other words, in Fig. 4.24, 


every other gap appears inverted. This can be taken into account by adding 


a phase angle —7 to @ as computed from (4.48). 





Fig. 4.22—The resonators of Fig. 4.11 coupled inductively. 


ofa[Cys 


Fig. 4.23—Another representation of the resonators of Fig. 4.11 
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Fig. 4.24—F igure 4.23 with inductive J added 
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Fig. 4.25—-A 7 x transformation used in connection with the circuit of Fig. 4.24 





Now, the 7 configuration of inductances in a of Fig. 4.25 can be replaced 
by the mw configuration, 6 of Fig. 4.25. Imagine I and II to be connected 
together and a voltage to be applied between them and IIT. We see that 

Li — Be + 2L Va (4.! 
Imagine a voltage to be applied between I and II. We see that 
1 i we =e 1 Le + 2 L ub 
If Liiva K La, then Ly will be nearly equal to L, and Ly» > Ly. 

By means of such a 7 — x transformation we can redraw the equivalent 

circuit of Fig. 4.24 as shown in Fig. 4.26. The series susceptance B, is now 
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that of 1; , and the shunt susceptance is now that of the shunt resonant 
circuit consisting of C2 (the effective capacitance of the resonators) and J». 











Fig. 4.26—The final representation of the circuit of Fig. 4.11. 
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Fig. 4.27—The phase characteristic of the circuit of Fig. 4.11. 
The transverse resonance, By = 0, occurs at a frequency 
or=- ¥\ C2 Le 
Near this frequency the transverse susceptance is given by 


Br = 2( "e(w = wr) 


The longitudinal resonance occurs at a frequency 


wp = V2C2L,L2/(Li + 2L2) 
and near wz, 
B, = Co(w — wz) 
These are just the forms we found in connection with the structure of Fig. 
4.10; but we see that, in the case of the circuit of Fig. 4.11, the effective 
transverse capacitance is always twice the effective longitudinal capacitance 
(Ci/C7 1/2 in Fig. 4.19), and that w, > wr for attainable volume of /). 
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We obtain 6 vs w by adding —z to the phase angle from 4.48, using (4.55) 
and (4.57) in obtaining By and B, . The phase angle vs. frequency is shown 
in Fig. 4.27. As the irises are made larger, the bandwidth, w, — wr , becomes 


larger, largely by a decrease in w, . 
The voltage of interest is that across C,, that is 
From (4.37), (4.44), (4.45), (4.55) and (4.57) we obtain 
V? F = 2 (—BrB,)'? (4.58) 
(4.59) 


, that across the gap. 


V'/P = (+/2/C2)((wr — w)(w — wy 
This goes to infinity at both w = w, and w» = wr. In Fig. 4.28, 
(V2/P)CoV w, wr is plotted vs w. This curve represents the performance of 


all narrow band structures of the type shown in Fig. 4.11. 














fo) 
Wr > WO. 
Fig. 4.28—A quantity proportional to (#?/8*P) for the circuit of Fig. 4.11, plotted vs 


radian frequency w. 


In a structure such as that shown in Fig. 4.11, there is little coupling 
between sections which are not adjacent, and hence the lumped-circuit 
representation used is probably quite accurate, and is certainly more ac- 
curate than in structures such as that shown in Fig. 4.10. 

Other structures could be analyzed, but it is believed that the examples 
given above adequately illustrate the general procedures which can be 
employed. 

4.4 TRAVELING FIELD COMPONENTS 

Filter-type circuits produce fields which are certainly not sinusoidal with 

distance. Indeed, with a structure such as that shown in Fig. 4.11, the elec- 





BELL SYSTEM TECHNICAL JOURNAL 


trons are acted upon only when they are very near to the gaps. It is possible 
to analyze the performance of traveling-wave tubes on this basis®. The chief 
conclusion of such an analysis is that highly accurate results can be obtained 
by expressing the field as a sum of traveling waves and taking into account 
only the wave which has a phase velocity near to the electron velocity. Of 
course this is satisfactory only if the velocities of the other components are 
quite different from the electron velocity (that is, different by a fraction 
several times the gain parameter C). 

As an example, consider a traveling-wave tube in which the electron stream 
passes through tubular sections of radius a, as shown in Fig. 4.29, and is 


acted upon by voltages appearing across gaps of length ¢ spaced L apart. 











EZ Z 
| | 
tke al fe 
Vn-t Vn Vn+1 
Fig. 4.29—A series of gaps in a tube of inside radius a. The gaps are ¢ long and are 
spaced ZL apart. Voltages V,, etc., act across them. 


A wave travels in some sort of structure and produces voltages across the 
gaps such that that across the mth gap, V, is 

r 7 né / 

Vn = Voe (4.60) 


where 7 is any integer. 
We analyze this field into traveling-wave components which vary with 
distance as exp(-j3,,z) where 


Bm = (0 + 2mr)/L (4.61) 


where m is any positive or negative integer. Thus, the total field will be 


>, En= 2, An€ ™ Iolver) (4.62) 


m 


Ym? = Bm? — Bo? (4.63) 


Here /o(y»7) is a modified Bessel function, and ym has been chosen so that 
(4.62) satisties Maxwell’s equations. 


> J. R. Pierce and Nelson Wax, “A Note on Filter-Type Traveling-Wave Amplifiers,” 
Proc. I.R.E., Vol. 37, pp. 622-625, June, 1949, 





FILTER-TYPE CIRCUITS 213 


We will evaluate the coefficients by the usual means of Fourier analysis 
Suppose we let z = 0 at the center of one of the gaps. We see that 


, 


2 


Li2 y 
[ EE*d: = > AmAx1i(ymr) dz 


J—] a) 


(4.64) 


All of the terms of the form E,,£, , p # m integrate to zero because the 
integral contains a term exp(-j24r(p — m)/L)z. 

Let us consider the field at the radius r. This is zero along the surface of 
the tube. We will assume with fair accuracy that it is constant and has a 
value — V/f across the gap. Thus we have also at r = a, 


Li2 x a li2 
/ EE* dz = — (V/?) 2 | Ame Mm? Tyna) dz 
9 m 20 €/2 


J] 


Bm t/2 


e Bm €/2 
—- eé 
(A*)I0(¥m a) ( 
)8 


We can rewrite this 


fo 


L/2 . 
bi! ; * sin (6,,¢/2) 
/ ER’ d¢ = ( A mlo(y¥ma) 

J—L/2 (Bm 2 


By comparison with (4.64) we see that 
Am = —(V/L)(sin (Bn €/2)/ (Bnl/2))(1/Tofya)) (4.67 


This is the magnitude of the mth tield component on the axis. The magnitude 
of the field at a radius r would be /o(yr) times this. 


s 
The quantity 2,,.f is an angle which we will call 6, , the gap angle. Usually 


we are concerned with only a single field component, and hence can merely 
write y instead of y,,. Thus, we say that the magnitude E of the travelling 
field produced by a voltage V acting at intervals L is 


E = —M(V/L) (4.68) 


sin (6g/2) Io(yr) 

M = 4 (4.69) 
(@g/2) Ipo(ya) 

Og = BC (4.70) 

The factor M is called the gap factor or the modulation coefficient*, 

For slow waves, y is very nearly equal to 8, and we can replace yr and ya 

by 8r and Ba. For unattenuated waves, M is a real positive number; and, 


* This factor is often designated by 8, but we have used 8 otherwise 





214 BELL SYSTEM TECHNICAL JOURNAL 


for the slowly varying waves with which we deal, we will always consider 
M as a real number. : 

The gap factor for some other physical arrangements is of interest. At a 
distance y above the two-dimensional array of strip electrodes shown in 
Fig. 4.30 


sin (6g/2) —4, 


fw 
(6¢/2) e (4.71) 
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Fig. 4.30—A series of slots 0, radians long separated by walls L long. 
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Fig. 4.31—A system similar to that of Fig. 4.30 but with the addition of an opposed 
conducting plane. 


If we add a conducting plane a at y = h, as in Fig. 4.31, 


sin (@g/2) sinh y(hk — y) BuTP) 
ee —————— (4.72) 
(6g/2) sinh yh 
For a symmetrical two-dimensional array, as shown in Fig. 4.32, with a 
separation of 2 / in the y direction and the fields above equal to the fields 
below 
sin (@g/2) cosh yy 


M = ——-— — (4.73) 
(6g/2) cosh yh 

4.5 EFFECTIVE FIELD AND EFFECTIVE CURRENT 
In Section 4.4 we have expressed a field component or “effective field” 
in terms of circuit voltage by means of a gap-factor or modulation coeffi- 
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cient M. This enables us to make calculations in terms of fields and currents 
at the electron stream. 


The gap factor can be used in another way. A voltage appears across a 
gap, and the electron stream induces a current at the gap. At the electron 
stream the power P; , produced in a distance L by a convection current 
i with the same 2-variation as the field component considered, acting on the 


field compone xt is 


P, = —Ei*L 
(4.74) 
+(MV)i* 











8g 


Fig. 4.32—-A system of two opposed sets of slots. 
At the circuit we observe some impressed current J flowing against the 
voltage V to produce a power 
P,; = VI* (4.75) 


By the conservation of energy, these two powers must be the same, and we 
deduce that 


[* = Mi* (4.76) 

or, since we take M as a real number 
I = Mi (4.77) 
Thus, we have our choice of making calculations in terms of the beam 
current and a field component or effective field, or in terms of circuit voltage 
and an effective current, and in either case we make use of the modulation 


coefficient M. 
Our gain parameter C* will be 


C3 = (V/L)* M71) /82V 5 
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where V is circuit voltage. We can regard this in two ways. We can think 
of —(V/L)M as the effective field at the location of the current J) , or we 
can think of M°/o as the effective current referred to the circuit. 

If we have a broad beam of electrons and a constant current density J 
we compute (essentially as in Chapter IIT) a value of C*® by integrating 


C3 = (1/88°V)J(V/L)? | M? do (4.78) 


where do is an element of area. We can think of the result in terms of an 
effective field EF, 


(4.79) 


where o is the total beam area, and a total current o/o , or we can think of 
the integral (4.77) in terms of an effective current /,) given by 


Io = Jo | M? do (4.80) 


and the voltage at the circuit. 

Of course, these same considerations apply to distributed circuits. Some- 
times it is most convenient to think in terms of the total current and an 
effective field (as we did in connection with helices in Chapter III) and 
sometimes it is most convenient to think of the field at the circuit and an 
effective current. Either concept refers to the same mathematics. 


4.6 HARMONIC OPERATION 


Of the field components making up £ in (4.62) it is customary to regard 
the m = 0 component, for which 8 = @/L, as the fundamental tield com- 
ponent, and the other components as Harmonic components. These are some- 
times called Hartree harmonics. If the electron speed is so adjusted that the 
interaction is with the m = O or fundamental component we have funda- 
mental operation; if the electron speed is adjusted so that we have interac- 
tion with a harmonic component, we have harmonic operation. 

There are several reasons for using harmonic operation in connection 
with filter-type circuits. For one thing the fundamental component may 
appear to be traveling backwards. Thus, for circuits of the type shown in 
Fig. 4.11, we see from Fig. 4.27 that @ is always negative. Now, in terms of 
the velocity 7 

B=o0/v= 60/L (4.81) 


and if @ is negative, v7 must be negative. However, consider the m = 1 
component 


(4.82) 
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We see that, for this component, v is positive. 

The interaction of electrons with backward-traveling field components 
will be considered later. Here it will merely be said that, in order to avoid 
interaction with waves traveling in both directions, one must avoid having 
the electron speed lie near both the speed of a forward component and the 
speed of a backward component. 

In order that the fundamental component be slow, 6 must be large or L 
must be small. The largest value of @ is that near one edge of the band, where 
6 approaches z. Thus, the largest fundamental value of 8 is r/L, and to make 
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Fig. 4.33—The variation of phase with frequency for the fundamental (0 to m over the 
band) and a spatial harmonic (27 to 3m over the band). The dotted lines show w divided 
by the electron velocity for the two cases. For amplification over a broad band the dotted 
curve should not depart much from the filter characteristic. 


8 large with m = 0 we must make Z small and put the resonators very close 
together. This may be physically difficult or evén impossible in tubes for 
very high frequencies. The alternative is to use a harmonic component, 
for which 8 = (2mm + 6)/L. 

Another reason for using harmonic operation is to achieve broad-band 


operation. The phase of a filter-type circuit changes by w radians between 
the lower cutoff frequency w; and the upper cutoff frequency wT. Now, 
for the wave velocity to be near to the electron velocity over a good part 


of the band, 8 must be nearly a constant times w. Figure 4.33 shows how 
this can be approximately true for the m = 1 component even when it ob- 
viously won’t be for the m = O or fundamental component. Similarly, for 
a filter with a narrower fractional bandwidth and hence a steeper curve of 
6 vs w, a larger value of m might give a nearly constant value of 2. 


t The phase of some filters changes more than this, but they don’t seem good candidates 
for traveling-wave tube circuits. 
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CHAPTER V 
GENERAL CIRCUIT CONSIDERATIONS 
SYNOPSIS OF CHAPTER 


N CHAPTERS IIT AND IV, helices and filter-type circuits have been 

considered. Other slow-wave circuits have been proposed, as, for in- 
stance, wave guides loaded continuously with dielectric material. One may 
ask what the best type of circuit is, or, indeed, in just what way do bad cir- 
cuits differ from good circuits. 

So far, we have as one criterion for a good circuit a high impedance, 
that is, a high value of F* 3°P. If we want a broad-band amplifier we must 
have a constant phase velocity; that is, 8 must be proportional to frequency. 
Thus, two desirable circuit properties are: high impedance and constancy 
of phase velocity. 

Now, F£?/6?P can be written in the form 


2/8P = B/W, 


where W is the stored energy per unit length for a field strength E, and ov, 
is the group velocity. 

One way of making E* 8°P large is to make the stored energy for a given 
field strength small. In an electromagnetic wave, half of the stored energy 
is electric and half is magnetic. Thus, to make the total stored energy for a 
given field strength small we must make the energy stored in the electric 
field small. The energy stored in the electric field will be increased by the 
presence of material of a high dielectric constant, or by the presence of large 
opposed metallic surfaces, as in the circuits of Figs. 4.8 and 4.9. Thus, such 
circuits are poor as regards circuit impedance, however good they may be in 
other respects. 

If the stored energy for a given field strength is held constant, E*/3°P 


may be increased by decreasing the group velocity. It is the phase velocity 


v which should match the electron speed. The group velocity v7, is given in 
terms of the phase velocity by (5.12). We see that the group velocity may 
be much smaller than the phase velocity if —dv/dw is large. It is, for in- 
stance, a low group velocity near cutoff that accounts for the high imped- 
ance regions exhibited in Figs. 4.20 and 4.28. We remember, however, 
that, if the phase velocity of the circuit of a traveling-wave tube changes 
with frequency, the tube will have a narrow bandwidth, and thus the high 
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impedances attained through large values of —dv,/ dw are useful over a nar- 
row range of frequency only. 

If we consider a broad electron stream of current density Jy , the highest 
effective value of E°/6°P, and hence the highest value of C, will be attained 
if there is current everywhere that there is electric field, and if all of the 
electric field is longitudinal. This leads to a limiting value of C, which is 
given by (5.23). There Ao is the free-space wavelength. The nearest practical] 
approach to this condition is perhaps a helix of fine wire flooded inside and 
outside with electrons. 

In many cases, it is desirable to consider circuits for use with a narrow 
beam of electrons, over which the field may be taken as constant. As the 
helix is a common as well as a very good circuit, it might seem desirable 
to use it as a standard for comparison. However, the group velocity of the 
helix differs a little from the phase velocity, and it seems desirable instead 
to use a sort of hypothetical circuit or field for which the stored energy is 
almost the same as in the helix, but for which the group velocity is the same 
as the phase velocity. This has been referred to in the text as a “forced 
sinusoidal field.”’ In Fig. 5.3, (42/8°P)'* for the forced sinusoidal field is 
compared with (£*/s°P)** for the helix. 

Several other circuits are compared with this: the circular resonators of 
Fig. 5.4 (the square resonators of Fig. 5.4 give nearly the same impedance) 
and the resonant quarter-wave and half-wave wires of Figs. 5.6 and 5.7. 
The comparison is made in Fig. 5.8 for three voltages, which fix three phase 
velocities. In each case it is assumed that in some way the group velocity 
has been made equal to the phase velocity. Thus, the comparison is made on 
the basis of stored energies. The field is taken as the field at radius a (cor- 
responding to the surface of the helix) in the case of the forced sinusoidal 
field, and at the point of highest field in the case of the resonators. 

We see from Figs. 5.8 and 5.3 that a helix of small radius is a very fine 
circuit. 

In circuits made up of a series of resonators, the group velocity can be 
changed within wide limits by varying the coupling between resonators, as 
by putting inductive or capacitive irises between them. Thus, even cir- 


cuits with a large stored energy can be made to have a high impedance by 


sacrificing banawidth. 

The circuits of Fig. 5.4 have a large stored energy because of the large 
opposed surfaces. The wires of Fig. 5.6 have a small stored energy asso- 
ciated entirely with ‘‘fringing fields” about the wires. The narrow strips of 
Fig. 5.5 have about as much stored energy between the opposed flat sur- 
faces as that in the fringing field, and are about as good as the half-wave 
wires of Fig. 5.7. 

An actual circuit made up of resonators such as those of Fig. 5.4 will be 


Slap EEE ot te 
eR “ * 


Smee Tagen my tates 
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worse than Fig. 5.8 implies. Thus, there is a decrease of (E?/6?P)'? due to 
wall thickness. Thickening the flat opposed walls of the resonators decreases 
the spacing between the opposed surfaces, increases the capacitance and 
hence increases the stored energy for a given gap voltage. In Fig. 5.9 the 
factor f by which (£°/8°P)"* is reduced is plotted vs. the ratio of the wall 
thickness / to the resonator spacing L. 

There is a further reduction of effective field because of the electrical 
length, @ in radians, of the space between opposed resonator surfaces. 
The lower curve in Fig. 5.10 gives a factor by which (£?/6?P)"* is reduced 
because of this. If the resonator spacing, @, in radians, is greater than 2.33 
radians, it is best to make the opening, or space between the walls, only 
2.33 radians long by making the opposed disks forming the walls very 
thick. 

There is of course a further loss in effective field, both in the helix and in 
circuits made up of resonators, because of the falling-off of the field toward 
the center of the aperture through which the electrons pass. This was dis- 
cussed in Chapter IV. 

Finally, it should be pointed out that the fraction of the stored energy 
dissipated in losses during each cycle is inversely proportional to the Q of 
the circuit or of the resonators forming it. The distance the energy travels 
in a cycle is proportional to the group velocity. Thus, for a given Q the sig- 
nal will decay more rapidly with distance if the group velocity is lowered 
(to increase E* 8°P). Equations (5.38), (5.42) and (5.44) pertain to attenu- 
ation expressed in terms of group velocity. The table at the end of the 
chapter shows that a circuit made up of resonators and having a low enough 
group velocity to give it an impedance comparable with that of a helix can 
have a very high attenuation. 


5.1 Group AND PHASE VELOCITY 

Suppose we use a broad video pulse /(/), containing radian frequencies 
p lying in the range 0 to po , to modulate a radio-frequency signal of radian 
frequency w which is much larger than fp, so as to give a radio-frequency 
pulse f(/) 

5 try, - 
f(t) = e* Fd) (5.1) 

the functions F(/) and f(/) are indicated in Fig. 5.1. 

F(t), which is a real function of time, can be expressed by means of its 


Fourier transform in terms of its frequency components 


F(t) = | A(p)e””' dp (5.2) 


Po 
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Here A(p) is a complex function of p, such that A(— p) is the complex con- 
jugate of A(p) (this assures that F(f) is real). 
With F(t) expressed as in (5.2), we can rewrite (5.1) 


fi) = [ " A(pe*”! ap 5.3) 


Now, suppose, as indicated in Fig. 5.2, we apply the r-f pulse f(t) to the 
input of a transmission system of length L with a phase constant 8 which 


--F (t) 
F(t) 























Fig. 5.1—A radio-frequency pulse varying with time as f(/). The envelope varies with 
time as F(t). The pulse might be produced by modulating a radio-frequency source 
with F(t). 
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Fig. 5.2—When the pulse of Fig. 5.1 is applied to a transmission system of length L 
and phase constant 8(w) (a function of w), the output pulse g(/) has an envelope G(/). 


is a function of frequency. Let us assume that the system is lossless. The 
output g(/) will then be 


Po 
g(t) =| A(pleX@rPPOY ay (5.4) 
a 


We have assumed that f» is much smaller than w. Let us assume that over 
the range w — po tow + po, 8 can be adequately represented by 


q 
B= Bot = p 
Ow 


In this case we obtain 


- Po 
g(t) ine ei! Pot) ! A(p) 2? (tab dw) L) dp 


“Pd 


The envelope at the output is 


Po . 
G(t) = [ A(p) 2 {PR lau) L) dp 


Po 
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By comparing this with (5.2) we see that 


ti) o whe . © 
G(t) = F (: = :) 


In other words, the envelope at the output is of the same shape as at the 
input, but arrives a time 7 later 
0B 
rT=—L 
Ow 


This implies that it travels with a velocity 2, 


* —1 
~~ = L/r = (2°) 
Ow 


This velocity is called the group velocity, because in a sense it is the veloc- 
ity with which the group of frequency components making up the pulse 
travels down the circuit. It is certainly the velocity with which the energy 
stored in the electric and magnetic fields of the circuit travels; we could ob- 
serve physically that, if at one time this energy is at a position x, a time ¢ 
later it is at a position x + al. 

If the attenuation of the transmission circuit varies with frequency, the 
pulse shape will become distorted as the pulse travels and the group velocity 
loses its clear meaning. It is unlikely, however, that we shall go far wrong 


in using the concept of group velocity in connection with actual circuits. 


We have used earlier the concept of phase velocity, which we have desig- 
nated simply as v. In terms of phase velocity, 


(5.11) 


We see from (5.10) that in terms of phase velocity v the group velocity 


/ P aad | 
@® OV 
eet: i- F 
v Ow 


For interaction of electrons with a wave to give gain in a traveling-wave 


Uy is 


tube, the electrons must have a velocity near the phase velocity v. Hence, 
for gain over a broad band of frequencies, v must not change with frequency; 
and if v does not change with frequency, then, from (5.12), v7 = ». 

We note that the various harmonic components in a filter-type circuit 
have different phase velocities, some positive and some negative. The group 





GENERAL CIRCUIT CONSIDERATIONS 


velocity is of course the same for all components, as they are all aspects of 
one wave. Relation (4.61) is consistent with this: 


Bm = (0+ 2mr)/L (4.61) 
1/v1, = OBm/dw = (00/dw)/L (5.13) 

5.2 GAIN AND BANDWIDTH IN A TRAVELING-WaAveE TUBE 
We can rewrite the impedance parameter F*3°P in terms of stored 


energy per unit length W for a field strength £, and a group velocity 2, . 
If W is the stored energy per unit length, the power flow P is 


P= Ws: (5.14) 
and, accordingly, we have 
B/8eP = F/PWo, 
And, for the gain parameter, we will have 
C= (E/8Wr,)"8(1o/8V 0)" (5.16) 


For example, we see from Fig. 4.20 that E° s°P for the circuit of Fig. 4.10 
goes to infinity at the upper cut-off. From Fig. 4.17 we see that 04 dw, 
and hence 1,7, , go to infinity at the upper cutoff, accounting for the infinite 
impedance. We see also that 0@/dw goes to infinity at the lower cutoff, but 
there the slot voltage and hence the longitudinal field also go to zero and 
hence E* 3°P does not go to infinity but to zero instead. 

In the case of the circuit of Fig. 4.11, the gap voltage and hence the longi- 
tudinal field are finite for unit stored energy at both cutoffs. As 04 dw is 
infinite at both cutoffs, V*,P and hence F*/3°P go to infinity at both cut- 
offs, as shown in Fig. 4.28. 

To get high gain in a traveling-wave tube at a given frequency and volt- 
age (the phase velocity is specified by voltage) we see from (5.16) that we 
must have either a small stored energy per unit length for unit longitudinal 
field, or a small group velocity, v, . 

To have amplification over a broad band of frequencies we must have the 


phase velocity v substantially equal to the electron velocity over a broad 


band of frequencies. This means that for very broad-band operation, 1 
must be substantially constant and hence in a broad-band tube the group 
velocity will be substantially the same as the phase velocity. 

If the group velocity is made smaller, so that the gain is Increased, the 
range of frequencies over which the phase velocity is near to the electron 
velocity is necessarily decreased. Thus, for a given phase velocity, as the 
group velocity is made less the gain increases but the bandwidth decreases. 

Particular circuits can be compared on the basis of (E*/6°P) and band- 


enna enh ag an tame gg an 
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width. We have discussed the impedance and phase or velocity curves in 
Chapters III and IV. Field' has compared a coiled waveguide structure with 
a series of apertured disks of comparable dimensions. Both of these struc- 
tures must have about the same stored energy for a given field strength. 
He found the coiled waveguide to have a low gain and broad bandwidth 
as compared with the apertured disks. We explain this by saying that the 
particular coiled waveguide he considered had a higher group velocity than 
did the apertured disk structure. Further, if the coiled waveguide could be 
altered in some way so as to have the same group velocity as the apertured 
disk structure it would necessarily have substantially the same gain and 
bandwidth. 

In another instance, Mr. O. J. Zobel of these Laboratories evaluated the 


effect of broad-banding a filter-type circuit for a traveling-wave tube by 


m-derivation. He found the same gain for any combination of m and band- 
width which made 7 = 7,(00/dw = 0). We see this is just a particular 
instance of a general rule. The same thing holds for any type of broad- 
banding, as, by harmonic operation. 


5.3 A COMPARISON OF CIRCUITS 


The group velocity, the phase velocity and the ratio of the two are param- 
eters which are often easily controlled, as, by varying the coupling between 
resonators in a filter composed of a series of resonators. Moreover, these 
parameters can often be controlled without much affecting the stored energy 
per unit length. For instance, in a series of resonators coupled by loops or 
irises, such as the circuit of Fig. 4.11, the stored energy is not much affected 
by the loops or irises unless these are very large, but the phase and group 
velocities are greatly changed by small changes in coupling. 

Let us, then, think of circuits in terms of stored energy, and regard the 
phase and group velocities and their ratio as adjustable parameters. We 
find that, when we do this, there are not many essentially different configura- 
tions which promise to be of much use in traveling-wave tubes, and it is 
easy to make comparisons between extreme examples of these configura- 
tions. 


5.3a Uniform Current Density throughout Field 

Suppose we have a uniform current density Jo wherever there is longi- 
tudinal electric field. We might approximate this case by flooding a helix 
of very fine wire with current inside and outside, or by passing current 
through a series of flat resonators whose walls were grids of fine wire. 


‘Lester M. Field, “Some Slow-Wave Structures for Traveling-Wave Tubes,” Proc. 
I.R.E., Vol. 37, pp. 34-40, January 1949, 
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In the latter case, if resonators had parallel walls of very fine mesh normal 
to the direction of electron motion there would be substantially no trans- 
verse electric field. All the electric field representing stored energy would 
act on the electron stream. In this case, we would have 


W = Sfx d= (5.17) 


Here d is an elementary area normal to the direction of propagation. W 
given by this expression is the total electric and magnetic stored energy 
per unit length. Where E is less than its peak value, the magnetic energy 
makes up the difference. 

In evaluating E°/) in (5.16) we will have as an effective value 


(ET ete — tf Ed> 


Hence, we will have for the gain parameter C 


ke FE’ d 


a 


(5.19) 


It is of interest to put this in a slightly different form. Suppose Ay is the 
free-space wavelength. Then 
; 
aT ( a 
= (5.20) 
Xo 
where c is the velocity of light 
> 10 2 g 
c = 3 X 10 cm/sec = 3 X 10 m/se 


Further, we have for synchronism between the electron velocity wm 
and the phase velocity 7 
2nVo (5.21) 
Also 
1/V ue 
1/cV w/e 


>ieked 
= 377 ohms 
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Using (5.20), (5.21), (5.22) in connection with (5.19), we obtain 
a 2\ 1/3 
Cm nV u/e Je) 
167cv, 


11.16 (Joro?/2 4)" 


(5.23) 


We have in (5.23) an expression for the gain parameter C in case longi- 
tudinal fields only are present and in case there is a uniform current density 
Jy wherever there is a longitudinal field. 

In a number of cases, as in case of a large-diameter helix, or of a resonator 
with large apertures, the stored energy due to the transverse field is about 
equal to that due to the longitudinal field and C will be 2~"? times as great 
as the value of C given by (5.23). Thus, the value of C given by (5.23), or 
even 2~'% times this, represents an unattainable ideal. It is nevertheless 
of interest in indicating how limiting behavior depends on various parame- 
ters. For instance, we see that if the wavelength A» is made shorter, a higher 
current density must be used if C is not to be lowered; for a constant C 
the current density must be such as to give a constant current through a 
square a wavelength on a side. 

In the table below, some values of C have been computed from (5.23) 
for various wavelengths and current densities. The broad-band condition 
of equal phase and group velocities has been assumed, and the voltage has 
been taken as 1,000 volts. 


\ 
‘ 
3 
Wavelength\. Amp/cm? 
Cm 
J 1 


.060 130 


013 028 


For larger voltages, C will be smaller. C can of course be made larger by 
making the group velocity smaller than the phase velocity. 

Of course, if the electron stream does not pass through some portions of 
the field, C will be smaller than given by (5.23). C will also be less if there 
are “harmonic” field components which do not vary in the z direction as 
exp(jwz/7). 


5.36 Narrow Beams 


Usually, no attempt is made to fill the entire field with electron flow even 
though this is necessary in getting a large value of C for a given current 
density. Instead a narrow electron beam is shot through a region of high 
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field. We then wish to relate the peak field strength to the stored energy in 
comparing various circuits. 

Let us first consider a helically conducting sheet of radius a. The upper 
curve of Fig. 5.3 shows (E2/6°P)!*(v/c)'* vs. 8a. In obtaining this curve it 
was assumed that v < c, so that y can be taken as equal to 8. The field E 
is the longitudinal field at the surface of the helically conducting cylinder. 
Figure 5.3 can be obtained from Fig. 3.4 by multiplying F(ya) by (Jo(ya))*" 
to give a curve valid for the field at r = a. 

The helix has a very smal! circumferential electric field which represents 
“useless” stored energy. The lower curve of Fig. 5.3 is based on the stored 
electric energy of an axially symmetrical sinusoidal field impressed at the 
radius a.t This field has no circumferential component but is otherwise the 

€ 
5 


FORCED - 
SINUSOIDAL 


2 
Aa 
Fig. 5.3—The impedance parameter (2/8?P)'/* compared for a helically conducting 
sheet (A) and a forced sinusoidal field (B) with a group velocity equal to the phase ve 
locity. The helix has a higher impedance because the phase velocity is higher than the 
group velocity by a radio shown to the } power by curve C. 


same as the electric field of the helix (again assuming 7 < c). We can imagine 


{ 


such a field propagating because of an inductive sheet at the radius a, 
which provides stored magnetic energy enough to make the electric and 
magnetic energies equal. The quantity plotted vs. ga is (7 8°P)'* ( ©)" 
(v,/v)'8, 

The forced sinusoidal field is not the field of some particular circuit for 
which a certain group velocity 2, corresponds to a given phase velocity 1 
Hence, the factor (2, 7)'* is included in the ordinate, so that the curve will 
be the same no matter what group velocity is assumed. For the helically 


conducting sheet, a definite group velocity goes with a given phase velocity. 


In Fig. 5.3, the ordinate of the curve for the helically conducting sheet 


does not contain the factor (v,/7)'*. If, for instance, we assume 2, v 


t See Appendix III. 
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in connection with the curve for the forced sinusoidal field, then the two 
ordinates are both (£7/6°P)'* (v/c)"8 and the curve for the sheet is higher 
than that for the forced field because, for the helically conducting sheet, 


fo-----0.707Ao ------ 





(a) (b) 
Fig. 5.4—Pillbox and rectangular resonators. When a number of resonators are coupled 
one to the next, a filter-type circuit is formed. 


» < v for small values of ya. Curve C shows (v/2,)!* 


for the sheet vs. Ba. Aside from the influence of group 


—— 
/ 


velocity, we might have expected the curve for the 
sheet to be a little lower than that for the forced field 
because of the energy associated with the transverse 
electric field component of the sheet. This, however, 
becomes small in comparison with the transverse mag- 
netic component when v <c, as we have assumed. 
Various other circuits will be compared, using 
the impressed sinusoidal field as a sort of standard 








of reference. 








One of the circuits which will be considered is a 


«|e 
2 
9 
12 4 
Qa 
Q 
< 
| 
| 











series of flat resonators coupled together to make a 

filter. Figure 5.4a shows a series of very thin pill- 

OPPRox boxes with walls of negligible thickness. A small cen- 

02a 

Fig. 5.5—Resonators 

with the opposing paral i 

lel surfaces reduced to The diameter is chosen to obtain resonance at a 

lower stored energy and 
increase impedance. 


tral hole is provided for the electron stream, and the 
field & is to be measured at the edge of this hole. 


wavelength A»). Figure 5.4b shows a similar series 
of flat square resonators. 
For the round resonators it is found that* 


(E2/82P\3 = 5.36 (v/c)"3 (v/v) 
for the square resonators* 

(E2/82P)'3 = 5.33 (v/c)"3 (v/09)"8 
For practical purposes these are negligibly different. 


* See Appendix IIT. 
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Suppose we wanted to improve on such circuits by reducing the stored 
energy. An obvious procedure would be to cut away most of the flat opposed 
surfaces as shown in Fig. 5.5. This reduces the energy stored between the 
resonator walls, but results in energy storage outside of the open edges, 
energy associated with a “fringing field.”’ 

Going to an extreme, we might consider an array of closely spaced very 
fine wires, as shown in Fig. 5.6. Here there are no opposed flat surfaces, 
and all of the electric field is a fringing field; we have 
reached an irreducible minimum of stored energy in 
paring down the resonator. 

The structure of Fig. 5.6 has not been analyzed 
exactly, but that of Fig. 5.7 has. In Fig. 5.7, we have 
an array of fine, closely spaced half-wave wires be- 
tween parallel planes.* This should have roughly 
twice the stored energy of Fig. 5.6, and we will esti- 
mate (4° /@°P)'* for Fig. 5.6 on this basis. We obtain 
in Appendix III: 


For the half-wave wires, Fig. 5.6—Quarter-wave 
wires, Which have a min- 


(E2/B2P)"3 = 6.20 (v/t,)" (5.25) imum of stored energy. 
and hence for the quarter-wave wires, approximately 
(E?/8P)'* = 7.81 (v/2,)'4 
As we have noted, (2c), which appears in the expression for (/ 
for the sinusoidal field impressed at radius a and in (5.24) and (5.2 


qr c 


} 





| 
| J 
[ | 


> ---p 








| 


Fig. 5.7—Half-wave wires between parallel planes. The stored energy can be calculated 
for this configuration, assuming the wires to be very fine. The circuit does not propagate a 
wave unless added coupling is provided. 


function of the accelerating voltage. Figure 5.8 makes a comparison be- 
tween the sinusoidal field impressed at a radius a, curve A; the flat resona- 
tors, either circular or square, B; the half-wave wires, C; and the quarter- 

* There is no transverse magnetic wave propagation along such a circuit unless extra 


coupling or loading is provided. Behavior of nonpropagating circuits in the presence of an 
electron stream is considered in Section 4 of Chapter XTV. 
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wave wires C’. In all cases, it is assumed that the coupling is so adjusted as 
to make (v,/v) = 1 (broad-band condition). 

What sort of information can we get from the curves of Fig. 5.8? Con- 
sider the curves for 1,000 volts. Suppose we want to cut down the opposed 
areas of resonators, as indicated in Fig. 5.5, so as to make them as good as 
half-wave wires (curve C). The edge capacitance in Fig. 5.5 will be about 
equal to that for quarter-wave wires (curve C’). Curve C’ is about 3.7 times 
as high as curve B, and hence represents only about (1/3.7)* = .02 as much 
capacitance. If we make the opposed area in Fig. 5.5 about .01 that in Fig. 
5.4a or b, the capacitance* between opposed surfaces will equal the edge 
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Fig. 5.8—Comparisons in terms of impedance parameter of an impressed sinusoidal 
field (4), circular resonators (B), half-wave wires (C) and quarter-wave wires (C’) assuming 
the group and phase velocities to equal the electron velocity. The radius of the impressed 
sinusoidal field is a. 


capacitance and the total stored energy will be twice that for quarter-wave 
wires, or equal to that for half-wave wires. This area is shown approxi- 
mately to scale relative to Fig. 5.4 in Fig. 5.5. Thus, at 1,000 volts the 
resonant strips of Fig. 5.5 are about as good as fine, closely spaced half- 
wave wires. 

Suppose again that we wish at 1,000 volts to make the gain of the reso- 
nators of Fig. 5.4 (or of a coiled waveguide) as good as that for a helix with 
Ba = 3. For 8a = 3 the helix curve A is about 3.2 times as high as the resona- 


* This takes into account a difference in field distribution—that in Fig. 5.4b. 
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tor curve B. As (E*/6*P)"8 varies as (v/v,)'*, we must adjust the coupling 
between resonators so as to make 


v% = v/(3.2)* = .031 1 


in order to make (£°/8°P)"* the same for the resonators as for the helix. 
From (5.12) we see that this means that a change in frequency by a frac- 
tion .002 must change v by a fraction .06. Ordinarily, a fractional variation 
of v of +.03 would cause a very serious falling off in gain. At 3,000 me the 
total frequency variation of .002 times in v would be 6 me. This is then a 
measure of the bandwidth of a series of resonators used in place of a helix 
for which Ba = 3 and adjusted to give the same gain. 


1.0 





ad 
































Fig. 5.9—The factor f by which (E£*/s?P)*!* for a series of resonators such as those of 
Fig. 5.4 is reduced because of wall thickness ¢, in relation to gap spacing J 


5.4 PuysicaL LIMiraTIONS 

In Section 3.3b the resonators were assumed to be very thin and to have 
walls of zero thickness. Of course the walls must have finite thickness, and 
it is impractical to make the resonators extremely thin. The wall thickness 
and the finite transit time across the resonators both reduce /2/G°P. 
54a Effect of Wall Thickness 


Consider the resonators of Fig. 5.4. Let L be the spacing between resona- 
tors (1/L resonators per unit length), and ¢ be the wall thickness. Thus, the 


gap length is (L — /). Suppose we keep / and the voltage across each 
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resonator constant, so as to keep the field constant, but vary ¢. The capaci- 
tance will be proportional to (L — /)~ and, as the stored energy is the 
voltage squared times the capacitance, we see that (E?/8?P) 8 will be re- 
duced by a factor f, 


f=(1— ¢/L)" (5.27) 


The factor f is plotted vs. ¢/Z in Fig. 5.9. 


10 12 14 16 18 20 22 24 
TRANSIT ANGLE IN RADIANS 


Fig. 5.10—The lower curve shows the factor by which £?/6?P is reduced by gap length, 
6 in radians. If the gap spacing is greater than 2.33 radians, it is best to make the gap 2.33 
radians long. Then the upper curve applies. 


5.46 Transit Time 


\s it is impractical to make the resonators infinitely thin, there will be 
s ° 
some transit angle @, across the resonator, where 


6, = Bl (5.28) 


Here ¢ is the space between resonator walls, or, the length of the gap. 
If we assume a uniform electric field between walls, the gap factor M, 
that is, the ratio of peak energy gained in electron volts to peak resonator 
voltage, or the ratio of the magnitude of the sinusoidal field component 
produced to that which would be produced by the same number of infinitely 
thin gaps with the same voltages, will be (from (4.69) with r = a) 


sin (6,/2) 


M = : 
6,/2 


(5.29) 
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For a series of resonators 6, long with infinitely thin walls E*/6°P will be 
less than the values given by (5.24) and (5.25) by a factor M?*. This is 
plotted vs. 4 in Fig. 5.10. 


5.4c Fixed Gap Spacing 


Suppose it is decided in advance to put only one gap in a length specified 
by the transit angle @,. How wide should the gap be made, and how much 
will E?/g°P be reduced below the value for very thin resonators and infi- 
nitely thin walls? 

Let us assume that all the stored energy is energy stored between parallel 
planes separated by the gap thickness, expressed in radians as @ or in dis- 
tance as L 


6, = Be 
0, = BL 
Here ¢ is the gap spacing and ZL is the spacing between resonators 
From Section 4.4 of Chapter IV we see that if V is the gap voltage, the 
field strength E is given by 
E = MV/L 
The stored energy per unit length, W, will be 
W = W.V?/tL (5.30) 


Here W, is a constant depending on the cross-section of the resonators. 
Thus, for unit field strength, the stored energy will be 


W = WoL/tM? 
(5.31) 
W = W (6: 6 4)(O, 2)? sin? (0, 2) 


We see that W, is merely the value of W When 6, = 6, and @, = 0, or, 
for zero wall thickness and very thin resonators. Thus, the ratio W/W» re- 


lates the actual stored energy per unit length per unit field to this optimum 
stored energy for resonators of the same cross section. 

For 0, < 2.33, W/W, is smallest (best) for 6, = @, (zero wall thickness). 
For larger values of @,, the optimum value of @, is 2.33 radians and for 
this optimum value 


(W/W) = (1.450/0,)"8 (5.32) 


If 6, < 2.33, it is thus best to make @, = 6,. Then (E/8?P)"* is re- 
duced by the factor [sin(6/2)/(@/2)|**, which is plotted in Fig. 5.10. If 
6, > 2.33, it is best to make 6 = 2.33. Then (F?/s?P)"* is reduced from the 
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value for thin resonators with infinitely thin walls by a factor given by 
(5.32), which is plotted vs. 6, in Fig. 5.10. 

If there are edge effects, the optimum gap spacing and the reduction in 
(F2/8?P)"> will be somewhat different. However, Fig. 5.10 should still be a 
useful guide. 

In case of wide gap separation (large @,), there would be some gain in 
using reentrant resonators, as shown in Fig. 4.11, in order to reduce the 
capacitance. How good can such a structure be? Certainly, it will be worse 
than a helix. Consider merely the sections of metal tube with short gaps, 
which surround the electron beam. The shorter the gaps, the greater the 
capacitance. The space outside the beam has been capacitively loaded, 
which tends to reduce the impedance. This capacitance can be thought of 
as being associated with many spatial harmonics in the electric field, which 
do not contribute to interaction with the electrons. 


5.5 ATTENUATION 


Suppose we have a circuit made up of resonators with specified unloaded 
Q.+ The energy lost per cycle is 


W, = 24rWs/Q (5.33) 
In one cycle, however, a signal moves forward a distance L, where 
(5.34) 


we will call 2a, is 


whence 


(5.36) 


var 
“\ nN gv 


So defined, @ is the attenuation constant, and the amplitude will decay 


along the circuit as exp(—az). 
The wavelength, A, is given by 
A= vf 
The loss per wavelength in db is 
db, wavelength = 20 logy exp(a@d) 
db/wavelength 


v ‘9g 


7 Disregarding coupling losses, the circuit and the resonantors will both have this 
same VU. 
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We see that, for given values of v and Q, decreasing the group velocity, 


which increases -°/8°P, also increases the attenuation per wavelength. 


5.5a Attenuation of Circuits 


lor various structures, Q can be evaluated in terms of surface resistivity, 


R, the intrinsic resistance of space, /u/e = 377 ohms, and varous other 
parameters. For instance, Schelkunoff? gives for the Q of a pill-box resona- 
tor 


_ 1.20(-V/u/e/R) 
is 1+ a/h 


Here a is the radius of the resonator and h is the height. If we express the 
radius in terms of the resonant wavelength A» (a = 1.2d9 ), we obtain 


(5.39) 


Tr | / ‘e/ R)(v/c) p 
7 7 (5.40) 
(1+ h/a)n 
Here » is the number of resonators per wavelength (assuming the walls 
separating the resonators to be of negligible thickness); thus 


n= h A = (h Ao) fe v) (5.41) 
From (5.40) and (5.38) we obtain for a series of pill-box resonators 
db/wavelength = 8.68(R/ YVw/e)(e v,)1 + bh a)n (5.42) 


In Appendix IIT an estimate of the Q of an array of tine half-wave paral 
lel wires is made by assuming conduction in one direction with a surface 
resistance R. On this basis, Q is found to be 


Q = (V w/e R)(1 Cc) (5.43) 
and hence 


db/wavelength = 27.3(R/V/u/e)(c/v,) (5.44) 


lor non-magnetic materials, surface resistance varies as the square root 
of the resistivity times the frequency. The table below gives R for copper 
and db, wavelength for pill-box resonators for a < 1 (5.42) and for wires 
(5.44) for several frequencies 
f, me R, Ohms (db/wavelength 
Pill-box Resonators 
3,000 0142 3.3 X 10-n 


10,000 0260 6.0 XK 10°4n 
30,000 0450 10.4 & 10-‘n 


In Section 3.3b a circuit made up of resonators, with a group velocity 
031 times the phase velocity, was discussed. Suppose such a circuit were 


2 Electromagnetic Waves, S. A. Schelkunoff, Van Nostrand, 1943. Page 269 
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used at 1,000 volts (c/» 


16.5), were 40 wavelengths long, and had three 
copper resonators per wavelength. The total attenuation in db is given below 


f,mc Attenuation, db 
3,000 21 

10,000 38 

30,000 67 
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CHAPTER VI 


THE CIRCUIT DESCRIBED IN TERMS OF 
NORMAL MODES 


SYNOPSIS OF CHAPTER 


N CHAPTER II, the field produced by the current in the electron stream, 
which was assumed to vary as exp (—Iz), was deduced from a simple 
model in which the electron stream was assumed to be very close to an ar- 
tificial line of susceptance B and reactance X per unit length. Following 
these assumptions, the voltage per unit length was found to be that of 
equation (2.10) and the field E in the z direction would accordingly be T 
times this, or 
=. a 7 
E= T? it r? (6.1) 
Here we will remember that T is the natural propagation constant of 
the line, and K is the characteristic impedance. 
We further replaced K by a quantity 


F?/@P = 2K (6.2) 
where E is the field produced by a power flow /, and £ is the phase constant 
of the line. For a lossless line, T, is a pure imaginary and 

@= —-Ti (6.3) 
From (6.1) and (6.2) we obtain 
r 


is 
x1 1 (6.4) 


i(E"/B° P) . 


> — T%) 

To the writer it seems intuitively clear that the derivation of Chapter 
II is correct for waves with a phase velocity small compared with the 
velocity of light, and that (6.4) correctly gives the part of the field asso- 
ciated with the excitation of the circuit. However, it is clear that there are 
other field components excited; a bunched electron stream will produce a 
field even in the absence of a circuit. Further, many legitimate questions 
can be raised. For instance, in Chapter II capacitive coupling only was 
considered. What about mutual inductance between the electron stream 
and the inductances of the line? 
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The best procedure seems to be to analyze the situation in a way we know 
to be valid, and then to make such approximations as seem reasonable. One 
approximation we can make is, for instance, that the phase velocity of the 
wave is quite small compared with the speed of light, so that 

| Ts |? >> 60 = @/c)? (6.5) 

In this chapter we shall consider a lossless circuit which supports a group 
of transverse magnetic modes of wave propagation. The finned structure of 
Fig. 4.3 is such a circuit, and so are the circuits of Figs. 4.8 and 4.9 (assum- 
ing that the fins are so closely spaced that the circuit can be regarded as 
smooth). It is assumed that waves are excited in such a circuit by a current 
in the z direction varying with distance as exp (— Iz) and distributed normal 


to the z direction as a function of x and y,J (x, y). Such a current might 


arise from the bunching at low signal levels of a broad beam of electrons 
confined by a strong magnetic field so as not to move appreciably normal 
to the z direction. 

The structure considered may support transverse electric waves, but these 
can be ignored because they will not be excited by the impressed current. 

In the absence of an impressed current, any field distribution in the struc- 
ture can be expressed as the sum of excitations of a number of pairs of nor- 
mal modes of propagation. For one particular pair of modes, the field dis- 
tribution normal to the z direction can be expressed in terms of a function 
m,(x, y) and the field components will vary in the z direction as exp(+T',,z). 
Here the + sign gives one mode of the pair and the — sign the other. If 
I’, is real the mode is passive; the field decays exponentially with distance, 
If I’, is imaginary the mode is active; the field pattern of the mode propa- 
gates without loss in the z direction. 

An impressed current which varies in the z direction as exp(—I'z) will 
excite a field pattern which also varies in the z direction as exp(—TI'z), and 
as some function of x and y normal to the z direction. We may, if we wish, 
regard the variation of the field normal to the z direction as made up of a 
combination of the field patterns of the normal modes of propagation, the 
patterns specified by the functions 7,(x, v). Now, a pattern specified by 
m,(“, y) coupled with a variation exp(+T,z) in the z direction satisties 
Maxwell’s equations and the boundary conditions imposed by the circuit 
with mo impressed current. If, however, we assume the same variation with 
x and y but a variation as exp(—Tz) with 2, Maxwell’s equations will be 
satisfied only if there is an impressed current having a distribution normal 
to the z direction which also can be expressed by the function 7,,(x, v). 

Suppose we add up the various forced modes in such relative strength 
and phase that the total of the impressed currents associated with them is 
equal to the actual impressed current. Then, the sum of the fields of these 
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modes is the actual field produced by the actual impressed current. The 
field is so expressed in (6.44) where the current components /, are defined 
by (6.36). 

If it is assumed that there is only one mode of propagation, and if it is 
assumed that the field is constant over the electron flow, (6.44) can be put 
in the form shown in (6.47). For waves with a phase velocity small compared 
with the velocity of light, this reduces to (6.4), which was based on the simple 
circuit of Fig. 2.3. 

Of course, actual circuits have, besides the one desired active mode, an 
infinity of passive modes and perhaps other active modes as well. In Chapter 
VII a way of taking these into account will be pointed out. 

Actual circuits are certainly not lossless, and the fields of the helix, for 
instance, are not purely transverse magnetic fields. In such a case it is per- 
haps simplest to assume that the modes of propagation exist and to cal- 
culate the amount of excitation by energy transfer considerations. This has 
been done earlier’, at first subject to the error of omitting a term which 
later? was added. In (6.55) of this chapter, (6.44) is reexpressed in a form 
suitable for comparison with this earlier work, and is found to agree. 

Many circuits are not smooth in the z direction. The writer believes that 
usually small error will result from ignoring this fact, at least at low signal 


levels. 


6.1 EXCITATION OF TRANSVERSE MAGNETIC MODES OF PROPAGATION BY 
A LONGITUDINAL CURRENT 


We will consider here a system in which the natural modes of propagation 
are transverse magnetic waves. The circuit of Fig. 4.3, in which a slow wave 
is produced by finned structures, is an example. We will remember that the 
modes of propagation derived in Section 4.1 of Chapter IV were of this 


type. We will consider here that any structure the circuit may have (fins, 


for instance) is fine enough so that the circuit may be regarded as smooth 


in the z direction. 

Any transverse electric modes which may exist in the structure will not 
be excited by longitudinal currents, and hence may be disregarded. 

The analysis presented here will follow Chapter X of Schelkunoff’s 
Klectromagnetic Waves. 

The divergence of the magnetic field H is zero. As there is no z component 
of field, we have 

1J. R. Pierce, “Theory of the Beam-Type Traveling-Wave Tube,” Proc. /.R.E., Vol 
35, pp. 111-123, February, 1947 


2 J. R. Pierce, “Effect of Passive Modes in Traveling-Wave Tubes,” Proc. [.RE 
Vol. 36, pp. 993-997, August, 1948. 
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OH, OH, 
—/’ =(Q f 
Ox + oy (6.6) 


This will be satisfied if we express the magnetic field in terms of a “stream 
function”’, r 


On 
H, es ‘ 
: (6.7) 


On 
Ox 
x can be identified as the z component of the vector potential (the vector 


potential has no other components). 
We will assume 7 to be of the form 


r= (x, ye" (6.9) 
Here # (x, y) is a function of x and y only, which specifies the field dis- 
tribution in any 4, y plane. 
We can apply Maxwell’s equations to obtain the electric fields 


OH, OH, =e 


quer, 


H, = (6.8) 


oy 02 
Using (6.7) and (6.8), and replacing differentiation with respect to z by 
multiplication by —I’, we find 


(6.10) 
Similarly 


(6.11) 


We see that in an x, y plane, a plane perpendicular to the direction of propa- 
gation, the field is given as the gradient of a scalar potential V 


V = (—jT'/we)r (6.12) 

This is because we deal with transverse magnetic waves, that is, with waves 

which have no longitudinal or z component of magnetic field. Thus, a closed 

path in an x, y plane, which is normal to the direction of propagation, will 

link no magnetic flux, and the integral of the electric field around such a 
path will be zero. 

We can apply the curl relation and obtain E, 
OH, OH; 
: Ov 


- 


j (ar , an 
E,=2 + 
we (* t 0 *) 


= jwekz 
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Applying Maxwell’s equations again, we have 


j a9(#e , OR 
we Oy \ Ox? oy" 
This is certainly true if 

an On 


bb EO at\s 
= fag ay (I + Bo)# 
Bo = wV pe = w/c 
We find that this satisfies the other curl £ relations as well. 
From (6.16) and (6.14) we see that 


E, = (—j/we)(T? + Bo)#(x, ye ™ (6.18) 


For a given physical circuit, it will be found that there are certain real 
functions #,(x, y) which are zero over the conducting boundaries of the 
circuit, assuring zero tangential field at the surface of the conductor, and 
which satisfy (6.16) with some particular value of ', which we will call T’,, . 
Thus, as a particular example, for a square waveguide of width W some 
(but not all) of these functions are 


t.(x, y) = cos (nry/W) cos (nrx/W) (6.19) 


where » is an integer. We see from (6.10), (6.11) and (6.18) that this makes 
E,, E, and E, zero at the conducting walls x = +W/2, y = +W/2. 

Each possible real function #,(x, y) is associated with two values of 
I’, , one the negative of the other. The I’,’s are the natural propagation 
constants of the normal modes, and the #,’s are the functions giving their 
field distribution in the x, y plane. The #,’s can be shown to be orthogonal, 
at least in typical cases. That is, integrating over the region in the x, y 
plane in which there is field 


[| a(x, y) #.(x, y) dx dy = () 


nw~zm 


(6.20) 


For a lossless circuit the various field distributions fall into two classes: 
those for which I, is imaginary, called active modes, which represent 
waves which propagate without attenuation; and those for which I’, is 
real, which change exponentially with amplitude in the z direction but do 
not change in phase. The latter can be used to represent the disturbance 
in a waveguide below cutoff frequency, for instance. 
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If I’, is imaginary (an active mode) the power flow is real, while if I, is 
real (a passive mode) the power flow is imaginary (reactive or ‘“wattless”’ 
power). 

The spatial distribution functions #, and the corresponding propagation 
constants I’,, are a means for specifying the electrical properties of a physical 
structure, just as are the physical dimensions which describe the physical 
structure and determine the various #,’s and T’,’s. In fact, if we know the 
various 7,,’s and I’,,’s, we can determine the response of the structure to an 
impressed current without direct reference to the physical dimensions. 

In terms of the #,’s and I’,’s, we can represent any unforced disturbance 
in the circuit in the form 


> 4.x, y[Ane + Bre] (6.21) 


Here A,, is the complex amplitude of the wave of the mth spatial distribu- 
tion traveling to the right, and B, the complex amplitude of the wave of 
the same spatial distribution traveling to the left. 

It is of interest to consider the power flow in terms of the amplitude, /,, 
or B,. We can obtain the power flow P by integrating the Poynting vector 
over the part of the x, y plane within the conducting boundaries 


Pu! | EXH* ds 


2. 
1 ee 
p=! [ | (E,H* — E,H*) dx dy 


By expressing the fields in terms of the stream function, we obtain 
‘ cod iT, e - Ot, ps4 
P= A,An : ; + | - dx dy (6.23) 
2we x Oy : 


We can transform this by integrating by parts (essentially Green’s 
theorem). Thus 


dx .- * dx (6.24) 


a at, a4, . , ) 2) ‘ are, 
Je; OR O% x |x Ox” 


Here x; and x2, the limits of integration, lie on the conducting boundaries 
where 7, = 0, and hence the first term on the right is zero. Doing the same 
for the second term in (6.23), we obtain 


~ iT ; 
2we | 
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By using (6.16), we obtain 


ny G *) (r?, + 62) | / (#,)? dx dy (6.26) 
cWe 


It is also of interest to express the z component of the nth mode, E; 


“2m 3 


explicitly. For the wave traveling to the right we have, from (6.18), 


Em = Ay ey (I, + B)#,(x, y) (6.27) 
WE 


Let the field at some particular position, say, x = y = 0, be Eno. Then 
jweE sno 
A, = 72) a! , (6.28) 
QI n + Bo) #, (0, 0) 
and from (6.26) 


P (E E *) — jwel’, [| ” ( 2 : 
nn 2/ a 2 l#,(x, y)|° dx dy 
‘ ) 2r,.(0, OV(TS, + Bo) JU T,(X, v)} dx dy 


We can rewrite this 


E.no Ezno* _ 2#°.(0, O)(T, + 85) 


(77 a os (6.30) 
site —jwel’,(—T,) [| #,(x, v)} dx dy 
For an active mode in a lossless circuit, I’, is a pure imaginary, and the 
negative of its square is the square of the phase constant. Thus, for a par- 
ticular mode of propagation we can identify (6.30) with the circuit parame- 
ter E?/6°P which we used in Chapter IT. 

Let us now imagine that there is an impressed current J which flows in 
the z direction and has the form 


J=J (x, ye (6.31) 


According to Maxwell’s equations we must have 
oH OH a 
- y —_ - 2s = jwek, + P (6.32) 
Ox oy : 

Now, we will assume that the fields are given by some overall! stream func- 

tion # which varies with x and y and with z as exp(—TI2z). 
In terms of this function 7, /7,, H, and F,, E 
(6.7), (6.8), (6.10), (6.11). However, the relation used in obtaining F, is 


will be given by relations 


4 


not valid in the presence of the convection current. Instead of (6.16) we 
have 


= we, + J 


Ox O* 


oH, eo OH, 


(6.33) 


] On On ; j 
~or ts aes A J 
we \ Ox" oy’ we 
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Again applying the relation 


we obtain 
9 y ‘. 
a ft ae we ol 4 De = J (6.34) 
= oF 
We will now divide both x and J into the spatial distributions charac- 
teristic of the normal unforced modes. 
Let 


I(x, y) = XS Tntalx, y) (6.35) 


n 


I I(x, y)#n(x, y) dx dy 
Jn = (6.36) 


/ [#,(x, y)}’ dx dy 





This expansion is possible because the z,’s are orthogonal. Let 


t#=6" > Crta(x, y) (6.37) 


Here there is no question of forward and backward waves; the forced ex- 
citation has the same 2-distribution as the forcing current. 
For the mth component, we have, from (6.16), 


d fn (a, y) @ ftn(, y) = Ted + Bo) tnx, y) (6.38) 


Ox” oy" 


From (6.34) we must also have 


” (dtsts y) 4 HAals, 2) 


Ox" oy" 
=> Cir + Bs) #a(x, y) i ee n(x, y) 


Accordingly, we must have 


(6.40) 


From (6.33) and (6.34) we see that 


E, = J (r+ @)s 
WE 
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—i(T” + 85) #,(x, y)In 
oP? — 1) 


es —iT” + Bo Bo) ob ftn(x, y)In (6.44) 


nN we T? on Fe 





(6.43) 


6.2 COMPARISON WITH RESULTS OF CHAPTER II 


Let us consider a case in which there is only one mode of propagation, 
characterized by #,(x, ” I',, and a case in which the current flows over a 
region in which #;(x, y) has a constant value, say, #:(0, 0). This corre- 
sponds to the case of si transmission line which was discussed in Chapter 
IT. 

We take only the term with the subscript 1 in (6.44) and (6.30). Combin- 
ing these equations, we obtain for the field at 0, 0 


2 + in ff ‘ t if / 
z, = (El P)T* + phy Tits fy ee yt - ‘y — (6.45) 


(Lt + B35) 2#,(0, 0) © 
We have from (6.36) 





r (0, 0) 





"iS (a(x, y)[? dx dy 


From (6.45) and (6.46) we obtain 


(I y 7 - BT 1(E 1/8 ‘P) 
2 


ma = Je (6.47) 
(Tj + 83)(r ~ i) 

Let us compare this with (6.4), which came from the transmission line 
analogy of Chapter II, identifying EZ. and J with E and 7. We see that, 
for slow waves for which : 


E, = 


Bo<«K | Ti| (6.48) 
Bo «| T"| (6.49) 
(6.47) becomes the same as (6.4). It was, of course, under the assumption 
that the waves are slow that we obtained (2.10), which led to (6.4). 
6.3 EXPANSION REWRITTEN IN ANOTHER FORM 


Expression (6.44) can be rewritten so as to appear quite different. We 
can write 


+ p= —Ta+Tn+ Bo 
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Thus, we can rewrite the acres for E, as 


n 


(6.50) 
+ (7/we) ys n(x, vIn 
The second term in the brackets is just j/we times the impressed current, 
as we can see from (6.35). The first term can be rearranged 


(—j/we)(T%, + B5)J, 


(—j/e(t®, + si) || #a(x, y)J(x, y) dedy 51) 


/ [#,(x, v)]° dx dy 


Referring back to (6.29), let Y, be twice the power P,, carried by the 
unforced mode when the field strength is 


Eno | ne 
Further, let us choose thez,,’s so that, at some specified position, x 


(0,0) = 1 


: —jwel’y, [fi ( ))? dx d 
‘=P ag | lt ee ee 


Using this in connection with (6.51), we obtain 


Then 


~ 


P,.#,(x, y) Jf +. (x, y)J (x, y) dx dy 


n V,(0 s n =— ['*) 
t we) J (x, y) 


An expression for the forced field in terms of the parameters of the nor- 
mal modes was given earlier’. In deriving this expression, the existence of 
a set of modes was assumed, and the field at a point was found as an in- 
tegral over the disturbances induced in the circuit to the right and to the 
left and propagated to the point in question. Such a derivation applies for 
lossy and mixed waves, while that given here applies for lossless transverse- 
magnetic waves only. 
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The earlier derivation! leads to an expression identical with (6.55) except 
that V7 appears in place of ¥,. In this earlier derivation a sign was im- 
plicitly assigned to the direction of flow of reactive power (which really 
doesn’t flow at all!) by saying that the reactive power flows in the direction 
in which the amplitude decreases. If we had assumed the reactive power to 
flow in the direction in which the amplitude increases, then, with the same 


* @.6 » ° ° * * 
definition of WV, , for a passive mode ¥, would have been replaced by —¥,, 


which is equal to W, (for a passive mode, WV, is imaginary). 

In deriving (6.55), no such ambiguity arose, because the power flow was 
identified with the complex Poynting vector for the particular type of wave 
considered. In any practical sense, V is merely a parameter of the circuit, 
and it does not matter whether we call Im ¥ reactive power flow to the right 
or to the left. 

The existence of a derivation of (6.55) not limited in its application to 
lossless transverse magnetic waves is valuable in that practical circuits often 
have some loss and often (in the case of the helix, for instance) propagate 
mixed waves. 


6.4 ITERATED STRUCTURES 


Many circuits, such as those discussed in Chapter IV, have structure in 
the z direction. Expansions such as (6.55) do not strictly apply to such struc- 
tures. We can make a plausible argument that they will be at least useful 
if all field components except one differ markedly in propagation constant 
from the impressed current. In this case we save the one component which 
is nearly in synchronism with the impressed current and hope for the best. 
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APPENDIX III 
STORED ENERGIES OF 
CIRCUIT STRUCTURES 
A3.1 ForcED SINUSOIDAL FIELD 


If v < c, the field can be very nearly represented inside the cylinder of 
radius a by 


 Io(Br) ~j. FE Io(Br) —52 
V — J cls oath eg ve Q. -_ . ial Jpz 
° 7,(Ba) 5B I(Ba) © (1) 
and outside by 
* is Ko(yr) - jp 
V = J ‘ies 
: A(ya) . 
Inside 


dV _ 2 Mi(6r) 
dr" I,(Ba) 
av 


az 


oe HPs Vo 


‘ T(Br) — jBz 
a «ee oy 
IBF (Ba) ° 

Outside 


av K,(8r) —s: 
sa — seal é 


K,(Ba) 


Oz Jt K. (Ba) 


aV .» Kol{ ~ ips 
Ve 3K 3r) oy, 


Because there is a sinusoidal variation in the z direction, the average stored 
electric energy per unit length will be 


Ws = (5)( (Ey max)" + (Esmaz)'|(2ar dr) (7) 


Here Ey maz and Fz maz are Maximum values at r = a. The total electric 
plus magnetic stored energy will be twice this. This gives 
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re(ya) E ed Iols 7 Ko K» = = E 


27? Ki 


, _ weya E 


(E’/6' P)'® = 


A3.2 Pitt-Box RESONATORS 


Schelkunoff gives on page 268 of Electromagnetic Waves an expression 
for the peak electric energy stored in a pill-box resonator, which may be 
written as 


135 re @hEe 

Here a is the radius of the resonator and / is the axial length. For a series 
of such resonators, the peak stored electric energy per unit length, which is 
also the average electric plus magnetic energy per unit length, is 

W= 135 7re7?F (10) 
For resonance 

= 1.2o/r 
Whence 
= 0618 A’p EL 
And 
(E2/2P)'3 = 5.36 (2 

The case of square resonators is easily worked out. 
A3.3 PARALLEL WIRES 


Let us consider very fine very closely spaced half-wave parallel wires with 
perpendicular end plates. 

If z is measured along the wires, and y perpendicular to z and to the 
direction of propagation, the field is assumed to be 


dar 


E. cos Bre” seg 
No 


E, E sin Bxe™*" 


Here the + sign applies for y < 0 and the — sign for y > 0. We will then 
find that 
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and 
(E2/s?P)'3 = 6.20 (v/0,)'8 (16) 
The surface charge density o on one side of the array of wires (say, y > 0) 
is given by the y component of field at y = 0. 
) 


a 2r z 
o = cH, = cE sin Bx cos — 2 (17) 
0 


This is related to the current / (flowing in the z direction) per unit distance 
in the x direction by 
ol Oo 
a a: eee (18) 
Oz dl 
From (18) and (17) we obtain for the current on one side of the array 


Jove 


; ee 
E sin Bx sin — z (19) 


dr 
If we use the fact that wro/ 27 = ¢ and ce = p/e, We obtain 
a _ Ie 
sin Bx sin - (20) 
ay M/e€ 0 


If R is the surface resistivity of either side (vy > 0, y < 0) of the wires, when 
the wires act as a resonator (a standing wave) the average power lost per 
unit length for both sides is 


P= LR\) E (u/e€) (21) 


In this case the stored electric energy is half the value given by (15), and 
we find 


O = (Vu/e/R) (0/c) (22) 





Factors Affecting Magnetic Quality* 
By R. M. BOZORTH 


N THE preparation of magnetic materials for practical use it is impor- 


tant to know how to obtain products of the best quality and uniformity. 
In the scientific study of magnetism the goal is to understand the relation 
between the structure and composition on the one hand and the magnetic 
properties on the other. From both standpoints it is necessary to know the 
principal factors which influence magnetic behavior. These are briefly 
reviewed here. 

The properties depend on chemical composition, fabrication and heat 
treatment. Some properties, such as saturation magnetization, change only 
slowly with chemical composition and are usually unaffected by fabrication 
or heat treatment. On the contrary, permeability, coercive force and hystere 
sis loss are highly sensitive and show changes which are extreme among all 
the physical properties. Properties may thus be divided into s/ructure 
sensilive and siructure-insensilive groups. As an example, Fig. 1 shows mag 
netization curves of permalloy after it has been (a) cold rolled, (b) annealed 
and cooled slowly, and (c) annealed and cooled rapidly. The maximum 
permeability varies with the treatment over a range of about 20 fold, while 
the saturation induction is the same within a few per cent. Structure sensi 
tive properties such as permeability depend on small irregularities in atomic 
spacings, which have little effect on properties such as saturation induction. 

Some of the more common sensitive and insensitive properties are listed 
in Table I. The principal physical and chemical factors which affect these 
properties are listed in column 3. Their various effects will now be briefly 
discussed and_ illustrated. 


Phase Diagram 


Some of the most drastic changes in properties occur when the fabrication 
or heat treatment has brought about a change in structure of the material 
For this reason the phase diagram or constitutional diagram is of the ut 
most importance in relation to the preparation and properties of magnetic 
materials. As an example consider the phase diagram of the binary iron 
cobalt alloys of Fig. 2. Here the various areas show the phases, of different 


* This article is the substance of Chapter II of a book entitled “Ferromagnetism”’ 
be published early in 1951 by D. Van Nostrand Company, In 
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composition or structure, which are stable at the temperatures and com- 
positions indicated. The a phase has the body-centered-cubic crystal struc- 
ture characteristic of iron. At 910°C it transforms into the face-centered 
phase y, and at 1400° into the 6 phase, which has the same structure as the 
a phase. At about 400°C cobalt transforms, on heating, from the e phase 
(hexagonal structure) into the y phase. 
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Fig. 1—Effect of mechanical and heat treatment on the magnetization curve of 70 
permalloy (70% Ni, 30% Fe). 
TABLE [ 
Properties Commonly Sensitive or Insensitive to Small Changes in Structure, and Some of the 
Factors which Effect Such Changes 


Structure-Insensitive Properties aay ge - ye ee ig vane 
, Saturation Magnetization a, Permeability Composition (gross) 
Curie Point H, Coercive Force Impurities 
, Magnetostriction at Saturation |W» Hysteresis Loss Strain 

Crystal Anisotropy Constant Temperature 
Crystal Structure 
Crystal Orientation 


The dotted lines indicate the Curie point, at which the material becomes 
non-magnetic. 

In between the areas corresponding to the single phases a, y, 6 and e 
there are two-phase regions in which two crystal structures co-exist, some 
of the crystal grains having one structure and others the other. Such a two- 
phase structure is usually evident upon microscopic or X-ray examina- 
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Courtesy of E. E. Thomas. Magnification: (a) 50 times, (b) 200 times 





254 BELL SYSTEM TECHNICAL JOURNAL 


tion. Microphotographs of a single-phase alloy and a two-phase alloy of 
iron-cobalt-molybdenum are reproduced in Fig. 3 (a) and (b). 

The diagram of Fig. 2 shows several kinds of changes that affect the mag- 
netic properties. At (a) the material becomes non-magnetic on heating, 
without change in phase. At (b) there is a change of phase, both phases 
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Fig. 4—Effect of phase transformation of cobalt on magnetization with a constant 
field of 150 oersteds. Both phases magnetic. Masumoto. 
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Nig. 5--Phase transformation in iron-cobalt alloy (50% Co). High-temperature phase 
is non-magnetic 


heing magnetic. Figure + shows the changes in magnetic properties that 


occur during this latter transition; they are due partly to the high local 
strains that result from the change in structure, and partly to the difference 
in the crystal structures of the two phases. At (c) there is a change from a 
ferromagnetic to a non-magnetic phase, and Fig. 5 shows the rapid change 
in magnetization that occurs when the temperature rises in this area. At 
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(d) the a phase becomes ordered on cooling, i.e., the iron and cobalt atoms 
tend to distribute themselves regularly among the various atom positions 
so that each atom is surrounded by atoms of the other kind. This phenome- 
non is especially important in connection with the properties of iron-alumi 
num and manganese-nickel alloys. 

The transition at (e) is entirely in the non-magnetic region but it has 
its influence on the properties of iron at room temperature. If iron is cooled 
very slowly through (e), the internal strains caused by the change in struc- 
ture will be relieved by diffusion of the metal atoms, but if the cooling is too 
rapid there will not be sufficient time for strain relief. Practically this means 
that to obtain high permeability in iron it must be annealed for some time 
below 900°C, or cooled slowly through this temperature so that diffusion 
will have time to occur. In most ferromagnetic materials diffusion occurs 
at a reasonably rapid rate only at temperatures above about 500 to 600°C. 
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Fig. 6—Effect of tension on the magnetization curve of 68 permalloy 


The effect of a homogeneous s/rain on the magnetization curve can be 
observed in a simple way, as by applying tension to an annealed wire and 
then measuring B and H. The effect of tension on some materials is to 
increase the permeability and on other materials to decrease it, as shown 
in Fig. 6. Compression usually causes a change in the opposite sense 

The internal strains resulting from plastic deformation of the material, 
brought about by stressing beyond the elastic limit, as by pulling, rolling 
or drawing, almost always reduce the permeability. The material is then 
under rather severe local strains similar to those present after phase change, 
and these strains are different in magnitude and direction in different places 
in the material and have quite different values at points close together 
Strains of this kind can usually be relieved by annealing; therefore, metal 
that has been fabricated by plastic deformation is customarily annealed to 
raise its permeability. Figure 1 shows the effect of annealing a permalloy 
strip that has been cold-rolled to 15 per cent of its original thickness 
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The temperature also is effective in changing permeability and other prop- 
erties, even when no change in phase occurs. Figure 7 shows the rapidity 
with which the initial permeability decreases as the Curie point is ap- 
proached. For this material, Ferroxcube III, a zinc manganese ferrite 
(ZnMnFe,Ox), the Curie point is not far above room temperature. 

The effect of impurities may be illustrated by the B vs H curves for iron 
containing various amounts of carbon. Curve (a) of Fig. 8 is for a mild 


'w 
°o 
° 
o 





MEABILITY, JLo 


ra 
oO 








500 
20 40 60 80 100 120 
TEMPERATURE IN DEGREES CENTIGRADE 
Fig. 7—Variation of initial permeability of Ferroxcube 3, showing maximum at tem 
perature just below the Curie temperature. 
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steel having 0.2 per cent carbon, (b) is for the iron commonly used in elec- 
tromagnetic apparatus—it contains about 0.02 per cent carbon and is an- 
nealed at about 900°C. When this same iron is purified by heating for several 
hours at 1400°C in hydrogen, the carbon is reduced to less than 0.001 per 
cent and other impurities are removed, and curve (c) is obtained. 

Finally, Fig. 9 shows that large differences in permeability may be found 
by simply varying the direction of measurement of the magnetic properties 
in a single specimen. The material is a single crystal of iron containing about 
4 per cent silicon, and the directions in which the properties are measured 
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are [100] (parallel to one of the crystal axes), and [111] (as far removed as 
possible from an axis). The magnetic properties in the two directions are 
different because different “‘views” of the atomic arrangement are ob 
tained in the two directions. 


PRODUCTION OF MAGNETIC MATERIALS 


In the preparation of magnetic materials for either laboratory or commer 
cial use there are many processes which influence the chemical and.physical 
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Fig. 9—Dependence of permeability on crystallographic direction. Wéilliams 


structure of the product. The selection of raw materials, the melting and 
casting, the fabrication and the heat treatment, are all important and must 
be carried out with a proper knowledge of the metallurgy of the material. A 
brief description of the common practices is now given. For further dis- 
cussion the reader is referred to more detailed metallurgical books and ar- 
ticles. 
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Melting and Casting 


For experimental investigation of magnetic materials in the laboratory, 
the raw materials easily obtainable on the market are generally satisfactory. 
When high purity is desirable specially prepared materials and crucibles 
must be used and the atmosphere in contact with the melt must be con- 
trolled. The impurities that have the greatest influence on the magnetic 
properties of high permeability materials are the non-metallic elements, 
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Fig. 10—Induction furnace designed for small melts in controlled atmosphere, as de 
signed by J. H. Scaff and constructed by the Ajax Northrup Company. 


particularly oxygen, carbon and sulfur, and the presence of these impurities 
is therefore watched carefully and their analyses are carried out with special 
accuracy. Impurities are likely to change in important respects during the 
melting and pouring on account of reactions of the melt with the atmos- 
phere, the slag or the crucible lining, or because of reactions taking place 
among the constituents of the metal. 

Melting of small lots (10 pounds) is best carried out in a high-frequency 
induction furnace. Figure 10 shows such a furnace designed for melting ten 
to fifty pounds, and casting by tilting the furnace, the whole operation being 
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carried out in a controlled atmosphere. High-frequency currents (usually 
1,000 to 2,000 cycles/sec but sometimes much higher) are passed through 
the water-cooled copper coils, and the alternating magnetic field so produced 


Fig. 11—Arc furnace for large commercial melts. Courtesy of J. S. Marsh of the Bethle 


hem Steel Company 
heats the charge by inducing eddy currents in it. Crucibles are usually com- 


posed of alumina or magnesia. 
On a commercial scale melts of silicon-iron are usually made in the open 
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hearth furnace, in which pig-iron and scrap are refined and ferro-silicon 
added. The furnace capacity may be as large as 100 tons. Sometimes silicon- 
iron, and usually iron-nickel alloys, are melted in the arc furnace, in 
amounts varying from a few tons to 50 tons. A photograph of such a fur- 
nace, in the position of pouring, is shown in Fig. 11. The heat is produced 
in the arc drawn between large carbon electrodes immersed in the metal, 
the current sometimes rising to over 10,000 amperes. By tipping the fur- 
nace the melt is poured into a ladle, and from this it is poured into cast-iron 
molds through a valve-controlled hole in the ladle bottom. Special-purpose 
alloys, including permanent magnets, are prepared commercially in high- 


TABLE II 
Heats of Formation and Other Properties of Some Oxides (Sachs and Van Horn‘) 


f Heat of formation : f ‘ 
Oxide (Kilo-cal per gram Melting Point (°C) Density (g/cm?) 
atom of metal) 


CaO.. : i 152 > 2500 
BeO.. 144 > 2500 
MgO 144 2800 
Li,0.. 141 > 1700 
Al,O3 127 2050 
V202 116 1970 
TiO, 109 1640 
Na2,O 101 . 

SiO, 95 1670 
B.0; 94 580 
MnO 91 1650 
Zr( dy 8&9 2700 
ZnO 85 ™ 

POs 73 * 

SnO, 68 1130 
FeO 66 1420 
NiO 58 sh 


~ 


Coy 


SUD MUN DO De RW DWH WH WD 


eT OR 1 OOD ww 


* Sublimes. 
** Decomposes before melting 


frequency induction furnaces or in arc furnaces in quantities ranging from a 
fraction of a ton to several tons. 


Slags are commonly used when melting in air, both to protect from oxi- 
dation and to reduce the amounts of undesirable impurities. Common pro- 
tective coverings are mixtures of lime, magnesia, silica, fluorite, alumina, 
and borax in varying proportions. In commercial production different slags 
are used at different stages, to refine the melt; e.g., iron oxide may be used 
to decarburize and basic oxides to desulfurize. 

Melting in vacuum requires special technique that has been described in 
some detail by Yensen.! Commercial use has been described by Rohn? and 
others.* Melting in hydrogen has been used on an experimental scale in both 

'T. D. Yensen, Trans. A.I.E.F. 34, 2601-41 (1915). 


2 W. Rohn, Heraeus Vacuumschmelze, Albertis, Hanau, 356-80 (1933). 
3 W. Hessenbruch and K. Schichtel, Zeits. f. Metallkunde 36, 127-30 (1944). 
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high-frequency and resistance-wound furnaces. In commercial furnaces Rohn 
has used hydrogen and vacuum alternately before pouring, for purification 
in the melt, in low-frequency induction furnaces having capacities of several 
tons. 

Just before casting a melt of a high-permeability alloy such as iron nickel, 
a deoxidizer may be added, e.g. aluminum, magnesium, calcium or silicon, 
in an amount averaging around 0.1 per cent. The efficacy of a deoxidizer is 
measured by its heat of formation, and this is given for the common ele- 
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Fig. 12—Solubility of some gases in iron and nickel at various temperatures. Sieverts 
ments in Table II, taken from Sachs and Van Horn.‘ Also several tenths of a 
per cent of manganese may be put in to counteract the sulfur so that the 
material may be more readily worked; the manganese sulfide so formed col- 
lects into small globular masses which do not interfere seriously with the 
magnetic or mechanical properties of most materials. 

Ordinarily a quantity of gas is dissolved in molten metal, and this is likely 
to separate during solidification and cause unsound ingots. The solubilities 
of some gases in iron and nickel have been determined by Sieverts® and 
others and are given in Fig. 12, adapted from the compilation by Dushman.*® 
The characteristic decrease of solubility during freezing is apparent. Most 

4G. Sachs and K. R. Van Horn, Practical Metallurgy, Am. Soc. Metals, Cleveland 
(1940). 


6 A. Sieverts, Zeits. f. Metallkunde 21, 37-46 (1929). 
6S. Dushman, Vacuum Technique, Wiley, New York (1949 
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of the gases given off by magnetic metals during heating are formed from 
the impurities carbon, oxygen, nitrogen and sulfur; CO is usually given off 
in greatest amount from cast metal, and some N» and Hz are also found. 
Refining of the melt is therefore of obvious advantage, and the furnace of 
Fig. 10 is especially useful for this purpose. 

Small ingots are sometimes made by cooling in the crucible. Usually, 
however, ingots are poured into cast iron molds for subsequent reduction 
by rolling, etc.; permanent magnet or other materials are often cast in sand 
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Fig. 13——Design of rolls in a blooming mill for hot reduction of ingots to rod. Carnegte 
Illinois Steel Corp. 


in shapes which require only nominal amounts of machining or grinding 
for use in apparatus or in testing. Special techniques are used for specific 
materials. 


Other considerations important in the melting and pouring of ingots are 


proper mixing in the melt, the temperature of pouring, mold construction, 


inclusions of slag, segregation, shrinkage, cracks, blow holes, etc. 


Fabrication 
Magnetic materials require a wide variety of modes of fabrication, which 
can best be discussed in connection with the specific materials. The methods 


include hot and cold rolling, forging, swaging, drawing, pulverization, elec- 
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trodeposition, and numerous operations such as punching, pressing and 
spinning. In the commercial fabrication of ductile material it is common 
practice to start the reduction in a breakdown or blooming mill (Fig. 13) 
after heating the ingot to a high temperature (1200° to 1400°C). Large ingots, 
of several tons weight, are often led to the mill before they have cooled 
below the proper temperature. The reduction is continued as the metal 
cools, in a rod or flat rolling mill, depending on the desired form of the final 
product. When the thickness is decreased to 0.2 to 0.5 inch the material has 
usually cooled below the recrystallization temperature. Because of the dithi 


culty in handling hot sheets or rod of small thickness, they are rolled at or 


near room temperature, with intermediate annealings if necessary to soften 
or to develop the proper structure. In experimental work, rod is often 
swaged instead of rolled. 

In recent years the outstanding trends in methods of fabricating materials 
have been toward the construction of the multiple-roll rolling mill for roll 
ing thin strip, and the continuous strip mill for high-speed production on 
a large scale. Figure 14 shows the principle of construction of a typical 4-high 
mill ((a) and (b)), and of two special mills ({c) and (d)). In the 20-high 
Rohn’ mill and 12-high Sendzimir® mill the two working rolls are quite 
small (0.2 to one inch in diameter). These are each backed by two larger 
rolls and these in turn by others as indicated. In the Rohn mill (c), power is 
supplied to the two smallest rolls and the final bearing surfaces are at the 
ends of the largest rolls. In the Sendzimir mill (d) the power is supplied to 
the rolls of intermediate size and the bearing surfaces are distributed along 
the whole length of the largest rolls so that no appreciable bending of the 
rolls occurs. The small rolls reduce the thickness of thin stock with great 
efficiency, and the idling rolls permit the application of high pressure. 
In the Steckel mill power is used to pull the sheet through the rolls, which 
are usually 4+-high with small working rolls. 

The continuous strip mill is an arrangement of individual mills such that 
the strip is fed continuously from one to another and may be undergoing 
reduction in thickness in several mills simultaneously. Figure 15 shows a 
mill of this kind, used for cold reduction, with 6 individual! mills in tandem. 

For magnetic testing numerous forms of specimens are required for vari 
ous kinds of tests; these include strips for standard tests for transformer 
sheet, rings or parallelograms for conventional ballistic tests, “‘pancakes”’ 
of thin tape spirally wound for measurement by alternating current, ellip 
soids for high field measurements, and many others. The various forms are 


7W. Rohn, Heraeus Vacuumschmelze, Albertis, Hanau, 381-7 (1933 
8 T, Sendzimir, /ron and Steel Engr. 23, 53-9 (1946). 
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required to study or eliminate the effects of eddy-currents, demagnetizing 
fields and directional effects and to simulate the use of material in apparatus. 
Most of the needs arizing in commerce and in experimental investigation 
are filled by strips or sheets of thicknesses from 0.002 inch to 0.1 inch from 




















(b) 4-HIGH MILL, END VIEW 





























(Cc) 20-HIGH ROHN MILL (d) 12-HIGH SENDZIMIR MILL 


Fig. 14—Arrangement of rolls in mills used for reduction of thin sheet: (a) and (b) con- 
ventional 4-high mill; (c) Rohn 20-high; (d) Sendzimir 12-high. 


which coils can be wound or parts cut, by rods from which relay cores or 
other forms can be made, by powdered material used for pressing into cores 
for coils for inductive loading, and by castings for permanent magnets or 
other objects which may be machined or ground to final shape. 
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Heat-Treatment 


High permeability materials are annealed primarily to relieve the internal 
strains introduced during fabrication. On the contrary permanent magnet 
materials are heat-treated to introduce strains by precipitating a second 
phase. Heat-treatments are decidedly characteristic of the materials and 
their intended uses and are best discussed in detail in connection with them. 
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Fig. 16—Some common heat treatments for magnetic materials. 


Figure 16 shows some of the commonest treatments in the form of tempera- 
ture-time curves. The purpose of these various heating and cooling cycles, 
and typical materials subjected to them, may be listed as follows: 
(1) Relief of internal strains due to fabrication or phase-changes (furnace 
cool). Magnetic iron. 
(2) Increase of internal strains by precipitation hardening (air quench 
and bake). Alnico type of permanent magnets. 
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(3) Purification by contact with hydrogen or other gases. Silicon-iron 
(cold rolled), hydrogen-treated iron, Supermalloy. 
There are also special treatments, such as those used for “‘double-treated” 
permalloy, “‘magnetically annealed”? permalloy, and perminvar. 
Occasionally it is necessary to homogenize a material by maintaining the 
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temperature just below the freezing point for many hours. Heat-treatments 
also may affect grain size and crystal orientation. 

Furnaces for heat-treating have various designs that will not be considered 
here. A modern improvement has been the use of globar (silicon carbide) 
heating elements that permit treatment at 1300 to 1350°C in an atmosphere 
of hydrogen or air. 
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Further discussion of “Metallurgy and Magnetism” is given in an excel- 
lent small book of this title by Stanley.’ 
EFFECT OF COMPOSITION 
Gross Chemical Composition 


The effect of composition on magnetic properties will now be considered, 
using as examples the more important binary alloys of iron with silicon, 
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Fig. 18—-Variation of Bs and @ with the composition of iron-nickel alloys. 


nickel or cobalt, on which are based the most useful and interesting mate- 
rials. The iron-silicon alloys are used commercially without additions, the 
iron-nickel and iron-cobalt alloys are most useful in the ternary form; and 


many special alloys, for example material for permanent magnets, contain 


four or five components. 

igure 17 shows four important properties of the iron-silicon alloys of low 
silicon content, after they have been hot rolled and annealed. The commer- 
cial alloys (3 to 5% silicon) are the most useful because they have the best 


9 J. K. Stanley, Metallurgy and Magnetism, Am. Soc. Metals, Cleveland (1949). 
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combination of properties of various kinds. The properties shown in the 
figure are important in determining the best balance: the maximum per 
meability, um, only indirectly (it is a good measure of hysteresis loss and 
maximum field necessary in use), and the Curie point, @, only in a minor 
role. The saturation B,, permeability, and resistivity p, should all be as 
high as possible. B, , @ and p are structure insensitive, and vary with com 
position in a characteristically smooth way, practically independent of 
heat treatment; um depends on heat treatment (strain), impurities and 
crystal orientation. There are no phase changes to give sudden changes with 
composition of properties measured at room temperatures. 
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Fig. 19—-Variation of saturation magnetostriction, A,, and crystal anisotropy, A 
the composition of iron-nickel alloys. 


Some of the properties of the iron-nickel alloys are given in Figs. 18 and 
19, The change in phase from a@ to y at about 30 per cent nickel is responsible 
for the breaks at this composition. The permeabilities, uw) and u,, , (Fig. 20 
show characteristically the effect of heat treatment. The maxima are closely 
related to the points at which the saturation magnetostriction, A, , and crys 
tal anisotropy, A, pass through zero (Fig. 19). 

Additions of molybdenum, chromium, copper and other elements are 
made to enhance the desirable properties of the iron-nickel alloys 

The iron-cobalt alloys, some properties of which are shown in Fig. 21, are 
usually used when high inductions are advantageous. The unusual course of 
the saturation induction curve, with a maximum greater than that for any 
other material, is of obvious theoretical and practical importance. The su 
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Fig. 20—Dependence of the initial and maximum permeabilities (uo, um) of iron-nickel 
alloys on the heat treatment, 
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Fig. 21—Variation of B, and @ of iron-cobalt alloys with composition. 
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den changes in the Curie point curve are associated with a, y phase boun- 
daries, as mentioned earlier in this chapter. The peak of the permeability 
curve (Fig. 22) occurs at the composition for which atomic ordering is stable 
at the highest temperature (see also Fig. 2). The sharp decline near 95 per 
cent cobalt coincides with the phase change y,e at this composition. Addi 
tions of vanadium, chromium and other elements are used in making com- 
mercial ternary alloys. 

Some useful alloys based on the binary iron-silicon, iron-nickel and iron- 
cobalt alloys are described in Table ITI. 

The hardening of material resulting from the precipitation of one phase in 
another is often used to advantage when magnetic hardness (as in per 
manent magnets), or mechanical hardness, is desired. To illustrate this 
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Fig. 22—Variation of permeability at H = 10 oersteds, and of the critical temperature 
of ordering, with the composition of iron-cobalt alloys 


process consider the binary iron molybdenum alloys, a partial phase dia 
gram of which is given in Fig. 23. The effect of the boundary between the 
a and a + e¢ fields is shown by the variation of the properties with composi 
tion (Fig. 24a). Saturation magnetization and Curie point are affected but 
little, the principle change in the former being a slight change in the slope 
of the curve at the composition at which the phase boundary crosses 500° 

the temperature below which diffusion is very slow. The Curie point curve 
has an almost imperceptible break at the composition at which the phase 
boundary lies at the Curie temperature. The changes of maximum per 
meability and coercive force are more drastic; 4m drops rapidly as the amount 
of the second phase, ¢€, increases and produces more and more internal strain 
(Fig. 24b), and H, increases at the same time. The experimental points 
correspond to a moderate rate of cooling of the alloy after annealing. 
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Fig. 23—Phase diagram of iron-rich iron-molybdenum alloys, showing solid solubility 
curve important in the precipitation-hardening process 
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Fig. 24— Change of structure-insensitive properties (@ and B,) and structure-sensitive 
properties (um and H.) with the composition when precipitation-hardening occurs 





274 BELL SYSTEM TECHNICAL JOURNAL 


When the amount of the second phase is considerable (as in the 15% Mo 
alloy) it is common practice to quench the alloy from a temperature at which 
it is a single phase (e.g. 1100 or 1200°C) and so maintain it temporarily as 
such, and then to heat it to a temperature (e.g., 600°C) at which diffusion 
proceeds at a more practical rate. During the latter step the second phase 
separates slowly enough so that it can easily be stopped at the optimum 
point, after a sufficient amount has been precipitated but before diffusion 
has been permitted to relieve the strains caused by the precipitation. A 
conventional heat treatment for precipitation-hardening of this kind, used 
on many permanent magnet materials, has already been given in Fig. 16. 

In some respects the development of atomic order in a structure is like 
the precipitation of a second phase. When small portions of the material 
become ordered and neighboring regions are still disordered, severe local 
strains may be set up in the same way that they are during the precipitation 
hardening described above. The treatment used to establish high strains is 
the same as in the more conventional precipitation hardening. The decom- 


position of an ordered structure in the iron-nickel-aluminum system has 


been held responsible, by Bradley and Taylor,’ for the good permanent 
magnet qualities of these alloys. 

Some of the common permanent magnets, heat treated to develop in- 
ternal strains by precipitation of a second phase, or by the development of 
atomic ordering, are described in Table IV. 

The changes in properties to be expected when the composition varies 
across a phase boundary of a binary system are shown schematically by the 


curves of Fig. 25. 


Impurities 

The principle of precipitation hardening, as just described, applies also 
to the lowering of permeability by the presence of accidental impurities. 
For example, the solubilities of carbon, oxygen and nitrogen in iron, de- 
scribed by the curves of Fig. 26, are quite similar in form to the curve sep- 
arating the a and a + € areas of the iron-molybdenum system of Fig. 23; 
the chief difference is that the scale of composition now corresponds to con- 
centrations usually described as impurities. One expects, then, that the 
presence of more than 0.04 per cent of carbon in iron will cause the perme- 
ability of an annealed specimen to be considerably below that of pure iron. 
The amount of carbon present in solid solution will also affect the magnetic 
properties. 

Because the amounts of material involved are small, it is difficult to carry 
out well defined experiments on the effects of each impurity, especially in 


10 A, J. Bradley and A. Taylor, Proc. Roy. Soc. (London) 166, 353-75 (1938). 
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Fig. 25—Diagrams illustrating the changes in various properties that occur when a 
second phase precipitates. 


the absence of disturbing amounts of other impurities. Two examples of the 
effect of impurities will be given, in addition to Fig. 8. In Fig. 27 Yensen and 
Ziegler"! have plotted the hysteresis loss as dependent on carbon content, 


"T. D. Yensen and N. A. Ziegler, Trans. Am. Soc. Metals 24, 337-58 (1936). 
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the curve giving the mean values of many determinations. The hysteresis 
decreases rapidly at small carbon contents, when these are of the order of 
magnitude of the solid solubility at room temperature. 
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Fig. 26—Approximate solubility curves of carbon, oxygen and nitrogen in 
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Fig. 27—Effect of carbon content on hysteresis in iron. Yensen and Ziegler. 


Cioffi? has purified iron from carbon, oxygen, nitrogen and sulfur by 


heating in pure hydrogen at 1475°C, and has measured the permeability 


2P. P. Ciofhi, Phys. Rev. 39, 363-7 (1932 
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at different stages of purification. Table V shows that impurities of a few 
thousandths of a per cent are quite effective in depressing the maximum 
permeability of iron. 

Carbon and nitrogen, present as impurities, are known to cause ‘“‘aging”’ 
in iron—that is, the permeability and coercive force of iron containing these 
elements as impurities will change gradually with time when maintained 
somewhat above room temperature. As an example, a specimen of iron was 
maintained for 100 hours first at 100°C, then 150°C, then 100°C, and so on. 


TABLE V 
Maximum permeability of Armco iron with different degrees of purification, effected by heat 
treatment in pure hydrogen at 1475°C for the times indicated (P. P. Cioffi). 
Analyses from R. F. Mehl (private communication to P. P. Cioffi). 


‘ons — Composition in Per Cent 
Time of Treatment 

in Hours : 
ie S oO N Mn  g 


7000 | 0.012 0.018 0.030 | 0.0018 | 0.030 | 0.004 
16000 005 010 003 0004 . as 
30000 | .005 006 003 .0003 . 
70000 | .003 | - .003 .0001 | — 


227000 .005 < .003 003 | -0001 | .028 


Precision of analysis .001 .002 002 | .0001 


The corresponding changes in coercive force are given in the diagram of 
Fig. 28. A change of about 2-fold is observed. 


SoME IMPORTANT PHYSICAL PROPERTIES 


There are many physical characteristics that are important in the study 
of ferromagnetism from both the practical and the theoretical point of view. 
These include the resistivity, density, atomic diameter, specific heat, ex- 
pansion, hardness, elastic limit, plasticity, toughness, mechanical damping, 
specimen dimensions, and numerous others. In a different category may be 
mentioned corrosion, homogeneity and porosity. Most of these properties 
are best discussed in connection with specific materials or properties; only 
the most important characteristics will be mentioned here. A table of the 
atomic weights and numbers, densities, melting points, resistivities and 
coefficients of thermal expansion of the metallic elements, is readily avail- 
able in the Metals Handbook. 

Dissolving a small amount of one element in another increases the re- 
sistivity of the latter. To show the relative effects of various elements, the 
common binary alloys of iron and of nickel are shown in Figs. 29 and 30. 
}'rom a theoretical standpoint it is desirable to understand (1) the relatively 
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high resistivity of the ferromagnetic elements compared to their neighbors 

in the periodic table and (2) the relative amounts by which the resistivity 

of iron (or cobalt or nickel) is raised by a given atomic percentage of vari- 

ous other elements. From a practical standpoint, a high resistivity is usually 
AGING TEMPERATURE IN DEGREES CENTIGRADE 
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Fig. 28—Effect of nitrogen impurity on the coercive force of iron annealed successively 
at 100 and 150°C. 
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Fig. 29 Dependence of resistivity on the addition of small amounts of 1 


to iron. 


desirable in order to decrease the eddy-current losses in the material, and 
so decrease the power wasted and the lag in time between the cause and 
effect, for example, the time lag of operation of a relay. 


Knowledge of the afomic diameter is important in considering the effects 
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Fig. 30—Resistivity of various alloys of nickel. 
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of alloying elements, and values for the metallic and borderline elements are 
shown in Fig. 31. Most of the values are simply the distances of nearest 
approach of atoms in the element as it exists in the structure stable at room 
temperature. Atomic diameter is especially important in theory because the 
very existence of ferromagnetism is dependent in a critical way on the dis 
tance between adjacent atoms. This has been discussed more fully in a 
previous paper." 

Even when no phase change occurs in a metal, important changes in siruc- 
fure occur during fabrication and heat treatment, and these are compli- 
cated and imperfectly understood. When a single crystal is elongated by 
tension, slip occurs on a limited number of crystal planes that in general 
are inclined to the axis of tension. As elongation proceeds, the planes on 
which slip is taking place tend to turn so that they are less inclined to the 
axis. In this way a definite crystallographic direction approaches parallelism 


Pi. 
tt] ¥\ 


\ 


(a) ROLLED (b) RECRYSTALLIZED (Cc) DRAWN 


Fig. 32—The preferred orientations of crystals in nickel sheet and wire after fabrication 
and after recrystallization 


with the length of the specimen. In a similar but more complicated way, 
any of the usual methods of fabrication cause the many crystals of which it 
is composed to assume a non-random distribution of orientations, often 
referred to as preferred or special orientations, or textures. Some of the tex 
tures reported for cold rolled and cold drawn magnetic materials are given 
in Table VI, taken from the compilation by Barrett.'* The orientations of 
the cubes which are the crystallographic units are shown in Fig. 32 (a) and 


(c) for cold rolled sheets and cold drawn wires of nickel. 


Since the magnetic properties of single crystals depend on « rystallographic 


direction (anisotropy), the properties of polycrystalline materials in which 
there is special orientation will also be direction-dependent. In fact it is 
difficult to achieve isotropy in any fabricated material, even if fabrication 
involves no more than solidifying from the melt. The relief of the internal 


'9R. M. Bozorth, Bell Sys. Tech. Jl. 19, 1-39 (1940 
4C.S. Barrett, Structure of Metals, McGraw Hill, New York (1943 
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strains in a fabricated metal by annealing proceeds only slowly at low 
temperatures (up to 600°C for most ferrous metals) without noticeable grain 
growth or change in grain orientation, and is designated recovery. The prin- 
ciple change is a reduction in the amplitude of internal strains, and this can 
be followed quantitatively by X-ray measurements. Near the point of com- 
plete relief distinct changes occur in both grain size and grain orientation, 
and the material is said /o recrystallize. At higher temperatures grain growth 
increases more rapidly. The specific temperatures necessary for both re- 
covery and recrystallization depend on the amount of previous deformation, 
as shown in Fig. 33. Special orientations are also present in fabricated mate- 
rials after recrystallization, and some of these are listed in Table VI and illus- 
trated for nickel in Fig. 32 (b). 

As an example of the dependence of various magnetic properties on direc- 
tion, Fig. 34 gives data of Dahl and Pawlek! for a 40 per cent nickel iron 


TABLE VI 
Preferred Orientations in Drawn Wires and Rolled Sheets, Before and After Recrystalliza- 
tion, and in Castings (Barrett'*) 


The rolling plane and rolling direction, or wire axis, or direction of growth, are designated 


Drawn wires Rolled Sheets 
Metal Crystal As 
al re 2 
Structure Recrys 


sD " 
As Drawn tallized 


As Rolled Recrystallized 


[ron. i {110} [110] | (001), [110] and | (001), 15° to [100] 
others {110} 
Cobalt.. (001) 
Nickel {111] and (110), [112] and (100), [001] 
{100} others 


alloy reduced 98.5 per cent in area by cold rolling and then annealed at 
1100°C. After further cold rolling (50 per cent reduction) the properties 
are as described in Fig. 35. 

The mechanical properties ordinarily desirable in practical materials are 
those which facilitate fabrication. Mild steel is often considered as the 
nearest approach to an ideal material in this respect. Silicon iron is limited 


by its brittleness, which becomes of major importance at about 5 per cent 


silicon; this is shown by the curve of Fig. 36. Permalloy is ‘tougher’ than 
iron or mild steel and requires more power in rolling and more frequent 
annealing between passes when cold-rolled, but can be cold-worked to smaller 
dimensions. If materials have insufficient stiffness or hardness, parts of 
apparatus made from them must be handled with care to avoid bending 
and consequent lowering of the permeability. If the hardness is too great 
the material must be ground to size. This is the case with some permanent 
magnets. 


18Q, Dahl and F. Pawlek, Zeits. f. Metallkunde 28, 230-3 (1936). 
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Fig. 33—Dependence of the grain size of iron on the amount of deformation and on the 
temperature of anneal. Aenyon 
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Fig. 34—Variation of magnetic properties with the direction of measureme! 
of iron-nickel alloy (40% Ni) severely rolled (98.5%) and annealed at 1100°C 


The effect of size of a magnetic specimen is often of importance. This is 
well known in the study of thin films, and fine powders in which the smallest 





284 BELL SYSTEM TECHNICAL JOURNAL 


dimension is about 10~* cm or less. Many studies have been made of thin 
electrodeposited and evaporated films. Generally it is found that the per- 
meability is low and the coercive force high. The interpretation is uncertain 
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Fig. 35—Properties of the same material as that of Fig. 34, after it has been rolled 
annealed, and again rolled 





100,000 


80,000 











60,000 


40,000 


-RENGTH IN LB/SQ IN 





7 














a 





TENSILE S 


2 3 a 2 6 
PER CENT SILICON IN IRON 


Fig. 36—Variation of the breaking strength of iron-silicon alloys, showing the onset of 
brittleness near 4 per cent silicon. 


because it is difficult to separate the effects of strains and air gaps from the 
intrinsic effect of thickness, though it is known that each one of these vari- 
ables has a definite effect. As one example of the many experiments, we 
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will show here the effect of the thickness of electrodeposited films of cobalt. 
Magnetization curves are shown in Fig. 37 according to previously un- 
published work of the author. 
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Fig. 37 —Dependence of the magnetization curves of pure electrodeposited cobalt films 
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Fig. 38 Depe ndence of coercive force on the particle size of Mn Bi powder. Guillaud 


The high coercive force obtained in fine powders by Guillaud'® is one of 
the most clear cut examples of the intrinsic effect of particle size. The coer 
cive force increases by a factor of 15 as the size decreases to 5 K 10°4 cm 
(Fig. 38). 


16C, Guillaud, Thesis, Strasbourg (1943). 





BELL SYSTEM TECHNICAL JOURNAL 


Properties Affected by Magnetization 
In addition to the magnetization, other properties are changed by the 
direct application of a magnetic field. Some of these, and the amounts by 
which they may be changed, are as follows: 
Length and volume (magnetostriction) (0.01%) 
Electrical resistivity (5%) 
Temperature (magnetocaloric effect; heat of hysteresis) (1°C) 
Elastic constants (20 per cent) 
Rotation of plane of polarization of light (Kerr and Faraday 
effects) (one degree of arc) 
In addition to these properties there are others that change with tem- 


perature because the magnetization itself changes. Thus there is “‘anoma- 


lous”’ temperature-dependence of: 
Specific heat 
Thermal expansion 
Electrical resistivity 
Elastic constants 
Thermoelectric force 
and of other properties below the Curie point of a ferromagnetic material, 
even when no magnetic field is applied. 
Also associated with ferromagnetism are galvanomagnetic, chemical and 
other effects. 





Technical Articles by Bell System Authors Not Appearing 
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Measurement Method for Picture Tubes. M. W. BAtpwin.' Electronics, 
Mikey pp. 104 105, Nov., 1949. 

Diffusion in Binary Alloys. J. BARDEEN.’ Phys. Rev., V. 76, pp. 1403 
1405, Nov. 1, 1949. 

AsstrAct—Darken has given a phenomenological theory of diffusion in 
binary alloys based on the assumption that each constituent diffuses inde 
pendently relative to a fixed reference frame. It is shown that diffusion via 
vacant lattice sites leads to Darken’s equations if it is assumed that the 
concentration of vacant sites is in thermal equilibrium. Grain boundaries 
and dislocations may act as sources and sinks for vacant sites and act to 
maintain equilibrium. The modifications required in the equations if the 
vacant sites are not in equilibrium are discussed. 

Variable Phase-Shift Frequency-Modulated Oscillator. O. E. pe LANGE. 
I.R.E., Proc., V. 37, pp. 1328-1331, Nov., 1949. 

Asstract—The theory of operation of a phase-shift type of oscillator is 
discussed briefly. This oscillator consists of a broad-band amplitier, the out 


put of which is fed back to the input through an electronic phase-shiftin 


£ 
circuit. The instantaneous frequency is controlled by the phase shift through 
this latter circuit. True FM is obtained in that frequency deviation is 
directly proportional to the instantaneous amplitude of the modulating sig 
nal and substantially independent of modulation frequency 

A practical oscillator using this circuit at 65 me is described. 

Erosion of Electrical Contacts on Make.t L. H. Germer' and F. E. Ha 
worTH.' Jl. Applied Phys., V. 20, pp. 1085-1108, Nov., 1949. 

Apstract-—When an electric current is established by bringing two elec- 
trodes together, they necessarily discharge a capacity. Unless the current 
which is set up is above 1 ampere, the erosion which is produced in a low 
voltage circuit is appreciable only when the capacity is of appreciable size 
and when it is discharged very rapidly by an arc. When the are occurs, its 
energy is dissipated almost entirely upon the positive electrode and, when 
the circuit inductance is sufficiently low, melts out a crater intermediate in 
volume between the volume of metal which can be melted by the energy 


t A reprint of this article may be obtained on request to the editor 


‘BOE.L 
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and that which can be boiled. Some of the melted metal lands on the nega- 
tive electrode and, with repetition of the phenomenon, results in a mound 
of metal transferred from the anode to the cathode. This transfer, which is 
about 4X 10°" cc of metal per erg, is the erosion which occurs on the 
make of electrical contacts. 

The arc voltage is of the order of 15. If the initial circuit potential is 
more than about 50 volts, there may be more than one arc discharge, suc- 
cessive discharges being in opposite directions and resulting in the transfer 
of metal in opposite directions—always to the electrode which is negative. 

The occurrence of an arc is dependent upon the condition of the electrode 
surfaces and upon the circuit inductance. For “inactive” surfaces an arc 
does not occur for inductances greater than about 3 microhenries. Platinum 


‘ 


surfaces can be “‘activated” by various organic vapors, and in the active 
condition they give arcs even when the circuit inductance is greater than 
this limiting value by a factor of 10%. 

The Conductivity of Silicon and Germanium as A ffected by Chemically In- 
troduced Impurities. G. L. PEARSON.' Paper presented at A. I. E. E., Swamps- 
cott, Mass., June 20-24, 1949. Included in compilation on semiconductors. 
Elec. Engg., V. 68, pp. 1047-1056, Dec. 1949. 

Axsstract—Silicon and germanium are semiconductors whose electrical 
properties are highly dependent upon the amount of impurities present. 


For example, the intrinsic conductivity of pure silicon at room temperature 
is 4 X 10-° (ohm cm) and the addition of one boron atom for each million 
silicon atoms increases this to 0.8 (ohm cm)~', a factor of 2 & 10°. 


Although such impurity concentrations are too weak to be detected by 
standard chemical analysis, the use of radioactive tracers and the Hall 
effect has made it possible to make quantitative measurements at impurity 
concentrations as small as one part in 5 X 10°. 

Silicon and germanium are elements of the fourth group of the periodic 
table with the same crystal structure as diamonds and they have respec- 
tively 5.2 & 10” and 4.5 X 10” atoms per cubic centimeter. The addition 
of impurity elements of the third group such as boron or aluminum gives 
defect or p-type conductivity. Elements from the fifth group such as phos- 
phorous, antimony or arsenic give excess or n-type conductivity. 

The conductivity at room temperature, where it has been shown that 
each impurity atom contributes one conduction charge, is given by equa- 
tion (1) where N is the number of solute atoms per cubic centimeter. 


A + BN. (1) 
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The constants A and B for the various alloys investigated are given in the 
following table: 


Alloy 


Si + B +x 10° 
Si + P 4 xX 10° 
Ge + Sb se mre 


Equation (1) applies to solute atom concentrations as high as 5 X 10'" 
per cc. At higher concentrations the mobilities are lowered due to increased 
impurity scattering so that the computed conduction is higher than the 
measured. 

Microstructures of Silicon Ingots.t W. G. Prann' and J. H. Scarr.' 
Metals Trans., V. 185 (Jl. Metals, V. 1) pp. 389-392, June, 1949. 

Increasing S pace-Charge Waves.t J. R. Prerce.' Jl. Applied Phys., V.. 20, 
pp. 1060-1066, Nov. 1949. 

ABSTRACT—An earlier paper presented equations for increasing waves in 
the presence of two streams of charged particles having different velocities, 
and solved the equations assuming the velocity of one group of particles to 
be zero or small. Numerical solutions giving the rate of increase and the 
phase velocity of the increasing wave for a wide range of parameters, cover- 
ing cases of ion oscillation and double-stream amplification, are presented 
here. 

Traveling-Wave Oscilloscope. J. R. Pierce.’ Electronics, V. 22, pp. 97-99, 
Nov., 1949. 

Agpstract—This paper describes a 1,000 volt oscilloscope tube with a 
traveling-wave detlectihg system. The tube is suitable for viewing periodic 
signals with frequencies up to 500 me. A signal of 0.037 volt into 75 ohms 
deflects the spot one spot diameter. A few milliwatts input gives a good 


pattern, so that the tube can be used without an amplifier. The pattern is 


viewed through a sixty power microscope. 

P-type and N-type Silicon and the Formation of Photovoliaic Barrier in 
Silicon Ingots.t J. H. Scarr,! H. C. Toeurerer' and E. E. ScoumMacuer.! 
Metals Trans., V. 185 (Jl. Metals, V. 1) pp. 383-388, Jan., 1949. 

Longitudinal Noise in Audio Circuits. Ho. W. AuGcustapt' and W. PF. 
KANNENBERG.' Audio Engg., V. 34, pp. 22-24, 45, Jan., 1950. 

Transistors. J. A. BecKER.' Compilation of three papers presented at 
A. I. E. E. meeting Swampscott, Mass., June 20-24, 1949. Elec. Engg., 
V. 69, pp. 58-64, Jan., 1950. 
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A pplication of Thermistors to Control Networks.t J. H. BoLuMAn! and 
J. G. Kreer.' J. R. E., Proc., V. 38, pp. 20-26, Jan., 1950. 

AsstrAcTt—In connection with the application of thermistors to regulat- 
ing and indicating systems, there have been derived several relations be- 
tween current, voltage, resistance, and power which determine the electrical 
behavior of the thermistor from its various thermal and physical constants. 
The complete differential equation describing the time behavior of a di- 
rectly heated thermistor has been developed in a form which may be solved 
by methods appropriate to the problem. 

Sensitive Magnetometer for Very Small Areas.¥ D. M. Crapin.' Rev. Sct. 
Instruments, V. 20, pp. 945-946, Dec., 1949. 

Axsstract—A vibrating wire system for measuring weak magnetic fields 
is described for use in very small spaces. Quartz crystals are used for drivers 
to get sufficient velocity with very small displacements. To adjust the 
driving voltage to correspond exactly to the natural crystal frequency, the 
crystal is also used to regulate the oscillator. 

Method of Calculating Hearing Loss for Speech from an Audiogram.t H. 
FLETCHER.' Acoustical Soc. Am., Jl., V. 22, pp. 1-5, Jan., 1950. 

Asstract—The question frequently arises, Can one compute the hearing 
loss of speech from the audiogram and thus make it unnecessary to make a 
speech test after the hearing loss for several frequencies has been recorded. 
This paper shows that this can be done by taking a weighted average of the 
exponentials of the hearing loss at each frequency. Or if , is the hearing 
loss for speech and f; the hearing loss at each frequency, 


10%" = fG 100 "df 


The weighting factor G was determined by Fletcher and Galt from thresh- 
old measurements of speech coming from filter systems. As specifically 
applied to the case of hearing loss at the five frequencies 250, 500, 1000, 
2000 and 4000 cps, the above equation is approximately equivalent to 


B 9/10 


B. = —10 log [.01 X 10°" + 13 X 10 
+ 40 x 107% 4. 38 x 107%) 4 08 x 10 


(85 /10)1 
| 


where (; is hearing loss at 250 cps 
82 is hearing loss at 500 cps 
8; is hearing loss at 1000 cps 
84 is hearing loss at 2000 cps 
3; is hearing loss at 4000 cps 
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Designing for Air Purity. A. M. HANFMANN.? Heating & Ventilating, V. 
47, pp. 59-04, Jan., 1950. 

Reciprocity Pressure Response Formula Which Includes the Effect of the 
Chamber Load on the Motion of the Transducer Diaphragms.t M.S. Haw.ey.' 
Acoustical Soc. Am., Jl., V. 22, pp. 56-58, Jan., 1950. 

AsstTrAct—In order to reduce the effects of wave motion in the coupling 
chamber to permit reciprocity pressure response measurements to higher 
frequencies, only two of the three transducers involved are coupled at a 
time to the chamber. Given for these conditions is a derivation of the pres- 
sure response formula which includes the effect of the chamber load on the 
motion of the transducer diaphragms. 

Theory of the ‘‘Forbidden” (222) Electron Reflection in the Diamond Siruc- 
ture.t| R. D. Hemenretcu.! Phys. Rev., V. 77, pp. 271-283, Jan. 15, 1950. 

Apstract—The dynamical or wave mechanical theory of electron diffrac- 
tion is extended to include several diffracted beams. In the Brillouin zone 
scheme this is equivalent to terminating the incident crystal wave vector 
at or near a zone edge or corner. The problem is then one of determining the 
energy levels and wave functions in the neighborhood of a corner. The solu- 
tion of the Schrédinger equation near a zone corner is a linear combination 
of Bloch functions in which the wave vectors are determined by the boundary 
conditions and the requirement that the total energy be fixed. This leads to 
a multiplicity of wave vectors for each diffracted beam giving rise to inter- 
ference phenomena and is an essential feature of the dynamical theory. 

At a Brillouin zone edge formed by boundaries associated with reciprocal 
lattice points S and O the orthogonality of the unperturbed wave functions 
in conjunction with the periodic potential requires that another recipro- 
cal lattice point \ be included in the calculation. The indices of \ must be 
such that (AiA2A3) = (SiS283) — (gigegs). The perturbation at the zone edge 
results in non-zero amplitude coefficients Cg, Cs and Cj for the diffracted 


waves irrespective of whether or not the structure factor for \ ,s or g van- 
ishes. This is the basis of the explanation of the (222) reflection and since it 


oer ne 


arises through perturbation at a Brillouin zone edge or corner the term 


pet 


“perturbation reflection” is advanced to replace the commonly used ‘‘for 
bidden reflection.” 


ee er AMR A md 


The octahedron formed by the (222) Brillouin zone boundaries exhibits 


an array of lines due to intersections with other boundaries to form edges. 


This array of lines is called a “‘perturbation grid” and the condition for the 

occurrence of a (222) reflection is simply that the incident wave vector 

terminate on or near a grid line. Numerical intensity calculations are pre- 
+ A reprint of this article may be obtained on request to the editor of the B.S.T.J 
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sented which show that a strong (222) can be accounted for by the dynamical 
theory. 

An impedance network model is briefly discussed which may aid in quali- 
tative considerations of the dynamical theory for the case of several 
diffracted waves. 

Determination of g-Values in Paramagnetic Organic Compounds by Micro- 
wave Resonance. A. N. HoLpeN,!' C. Kitret,' F. R. Merritt! and W. A. 
YAGER.! Letter to the Editor, Phys. Rev., V. 77, pp. 146-147, Jan. 1, 1950. 

Vonlinear Coil Generators of Short Pulses.t L. W. Hussey.' [.R.E., Proc., 
V. 38, pp. 40-44, Jan., 1950. 

ABSTRACT—Small permalloy coils and circuits have been developed which 
produce pulses well below a tenth of a microsecond in duration with repeti- 
tion rates up to a few megacycles. 

The construction of these coils is described. Low power circuits are di- 
cussed suitable for different types of drive and different frequency ranges. 

Subjective Effects in Binaural Hearing. W. Korentc.' Letter to the Editor, 
Acoustical Soc. Am., Jl., V. 22, pp. 61-62, Jan., 1950. 

ABsTRACT—-Experiments witha binaural telephone system disclosed some 
remarkable properties, notably its ability to “squelch” reverberation and 
background noises, as compared to a system having only one pickup. No 
explanation has been found for this subjective effect. It was also discovered 
that a well-known defect in the directional discrimination of binaural sys- 
tems was remedied by a mechanical arrangement which rotated the pickup 
microphores as the listener turned his head. 

Corrosion Testing of Buried Cables. T. J. MairLanp.’ Corrosion, V. 6, pp. 
1-8, Jan., 1950. 

fOAC1 Carrier Telegraph System. A. L. MaAtte.' Tel. & Tel. Age, No. 2, 
pp. 7-9, Feb., 1950. 

Giving New Life to Old Equipment. P. H. Mieve.’ Bell Tel. Mag., V. 28, 
pp. 154-163, Autumn, 1949, 

Thermionic Emission of Thin Films of Alkaline Earth Oxide Deposited by 
Evaporation.t G. E. Moore! and H. W. Atitson.' Phys. Rev., V. 77, pp. 
246-257, Jan. 15, 1950. 


Asstrract--Monomolecular films of BaO or SrO were deposited by evap- 


oration on clean tungsten or molybdenum surfaces with precautions to elimi- 


nate effects caused by excess metal of the oxide or by heating. Thermionic 

emissions of the same order of magnitude as from commercial oxide cathodes 

have been obtained from these systems. The results can be explained quali- 

tatively by considering the adsorbed molecules as oriented dipoles. Although 
t A reprint of this article may be obtained on request to the editor of the B.S.T.J. 


'B.T.L 
A. T. & T, 





ARTICLES BY BELL SYSTEM AUTHORS 293 


the results may suggest a possible mechanism for a portion of the emission 
from thick oxide cathodes, there exist serious obstacles to such thin film 
phenomena as a complete explanation. 

Long Distance Finds the Way. W. H. NuNN» Bell Tel. Mag., V. 28, pp. 
137-147, Autumn, 1949. 

Private Line Services for the Aviation Industry. H. V. Roumrort.’ Bell 
Tel. Mag., V. 28, pp. 165-174, Autumn, 1949. 

Growing and Processing of Single Crystals of Magnetic Metals.t J. G. 
WaLkER,! H. J. Writtams' and R. M. Bozortu.' Rev. Sct. Jnstruments, 
V. 20, pp. 947-950, Dec., 1949. 

ABSTRACT—Single crystals of nickel, cobalt and various alloys are grown 
by slow cooling of the melt. They are oriented by optical means and by 
X-rays, and ground to the desired shape using the technique described 

A Look Around—and Ahead. L. A. Witson.* Bell Tel. Mag., V. 28, pp. 
133-136, Autumn, 1949. 


7 A reprint of this article may be obtained on request to the editor of the B.S.T.J 
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